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PREFACE 

Throughout the decade of the 1990’s, I taught a one-year course of a specialized nature to 
students who entered Yale College with excellent preparation in Mathematics and the Physical 
Sciences, and who expressed an interest in Physics or a closely related field. The level of the course 
was that typified by the Feynman Lectures on Physics. My one-year course was necessarily more 
restricted in content than the two-year Feynman Lectures. The depth of treatment of each topic was 
limited by the fact that the course consisted of a total of fifty-two lectures, each lasting one-and-a-quarter 
hours. The key role played by invariants in the Physical Universe was constantly emphasized. The 
material that I covered each Fall is presented, almost verbatim, in this book. 

The first chapter contains key mathematical ideas, including some invariants of geometry and 
algebra, generalized coordinates, and the algebra and geometry of vectors. The importance of linear 
operators and their matrix representations is stressed in the early lectures. These mathematical 
concepts are required in the presentation of a unified treatment of both Classical and Special Relativity. 
Students are encouraged to develop a “relativistic outlook’ at an early stage. The fundamental Lorentz 
transformation is developed using arguments based on symmetrizing the classical Galilean 
transformation. Key 4-vectors, such as the 4-velocity and 4-momentum, and their invariant norms, are 
shown to evolve in a natural way from their classical forms. A basic change in the subject matter 
occurs at this point in the book. It is necessary to introduce the Newtonian concepts of mass, 
momentum, and energy, and to discuss the conservation laws of linear and angular momentum, and 
mechanical energy, and their associated invariants. The discovery of these laws, and their applications 
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to everyday problems, represents the high point in the scientific endeavor of the 17th and 18th 
centuries. An introduction to the general dynamical methods of Lagrange and Hamilton is delayed until 
Chapter 9, where they are included in a discussion of the Calculus of Variations. The key subject of 
Einsteinian dynamics is treated at a level not usually met in at the introductory level. The 4momentum 
invariant and its uses in relativistic collisions, both elastic and inelastic, is discussed in detail in Chapter6. 
Further developments in the use of relativistic invariants are given in the discussion of the Mandelstam 
variables, and their application to the study of high-energy collisions. Following an overview of 
Newtonian Gravitation, the general problem of central orbits is discussed using the powerful method of 
[p, r] coordinates. Einstein’s General Theory of Relativity is introduced using the Principle of 
Equivalence and the notion of “extended inertial frames” that include those frames in free fall in a 
gravitational field of small size in which there is no measurable field gradient. A heuristic argument is 
given to deduce the Schwarzschild line element in the “weak field approximation”; it is used as a basis 
for a discussion of the refractive index of space-time in the presence of matter. Einstein’s famous 
predicted value for the bending of a beam of light grazing the surface of the Sun is calculated. The 
Calculus of Variations is an important topic in Physics and Mathematics; it is introduced in Chapter 9, 
where it is shown to lead to the ideas of the Lagrange and Hamilton functions. These functions are 
used to illustrate in a general way the conservation laws of momentum and angular momentum, and 
the relation of these laws to the homogeneity and isotropy of space. The subject of chaos is introduced 
by considering the motion of a damped, driven pendulum. A method for solving the non-linear 
equation of motion of the pendulum is outlined. Wave motion is treated from the point-of-view of 
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invariance principles. Tine form of the general wave equation is derived, and the Lorentz invariance of 
the phase of a wave is discussed in Chapter 12 . The final chapter deals with the problem of orthogonal 
functions in general, and Fourier series, in particular. At this stage in their training, students are often 
under-prepared in the subject of Differential Equations. Some useful methods of solving ordinary 
differential equations are therefore given in an appendix. 

The students taking my course were generally required to take a parallel one-year course in 
the Mathematics Department that covered Vector and Matrix Algebra and Analysis at a level suitable 
for potential majors in Mathematics. 

Here, I have presented my version of a first-semester course in Physics — a version that deals 
with the essentials in a no-frills way. Over the years, I demonstrated that the contents of this compact 
book could be successfully taught in one semester. Textbooks are concerned with taking many 
known facts and presenting them in clear and concise ways; my understanding of the facts is largely 
based on the writings of a relatively small number of celebrated authors whose work I am pleased to 
acknowledge in the bibliography. 

Guilford, Connecticut 

February, 2000 

I am grateful to several readers for pointing out errors and unclear statements in my first version of this 
book. Tine comments of Dr Andre Mirabelli were particularly useful, and were taken to heart. 

March, 2003 
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1 

MATHEMATICAL PRELIMINARIES 

1.1 Invariants 

It is a remarkable fact that very few fundamental laws are required to describe the enormous 
range of physical phenomena that take place throughout the universe. The study of these 
fundamental laws is at the heart of Physics. The laws are found to have a mathematical structure; the 
interplay between Physics and Mathematics is therefore emphasized throughout this book. For 
example, Galileo found by observation, and Newton developed within a mathematical framework, the 
Principle of Relativity: 

the laws governing the motions of objects have the same mathematical form in all inertial 

frames of reference. 

Inertial frames move at constant speed in straight lines with respect to each other- they are mutually 
non-accelerating. We say that Newton’s laws of motion are invariant under the Galilean transformation 
(see later discussion). The discovery of key invariants of Nature has been essential for the 
development of the subject. 

Einstein extended the Newtonian Principle of Relativity to include the motions of beams of light 
and of objects that move at speeds close to the speed of light. This extended principle forms the basis 
of Special Relativity. Later, Einstein generalized the principle to include accelerating frames of 
reference. The general principle is kncwn as the Principle of Covariance; it forms the basis of the 
General Theory of Relativity (a theory of Gravitation). 
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A review of the elementary properties of geometrical invariants, generalized coordinates, linear 
vector spaces, and matrix operators, is gven at a level suitable for a sound treatment of Classical and 
Special Relativity. Other mathematical methods, including contra- and covariant 4-vectors, variational 
principles, orthogonal functions, and ordinary differential equations are introduced, asrequired. 

12 Some geometrical invariants 

In his book The Ascent of Man, Bronowski discusses the lasting importance of the discoveries 
of the Greek geometers. He gives a proof of the most famous theorem of Euclidean Geometry, 
namely Pythagoras’ theorem, that is based on the invariance of length and angle (and therefore of 
area) under translations and rotations in space. Let a right-angled triangle with sides a, b, and c, be 
translated and rotated into the following four positions to form a square ofsicb c: 



h- (b-a) H 


The total area of the square = c? = area of four triangles + area of shaded square. 
If the right-angled triangle is translated and rotated to form the rectangle: 
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a a 



then the area of four triangles = 2ab. 

Tine area of the shaded square area is (b- a) 2 = b 2 - 2ab + a 2 

We have postulated the invariance of length and angle under translations and rotations and therefore 
c2=2ab + (b-a) 2 

= a 2 +b2. (1.1) 

We shall see that this key result characterizes the locally flat space in which we live. It is the only form 
that is consistent with the invariance of lengths and angles under translates and rotations . 

The scalar product is an important invariant in Mathematics and Physics. Its invariance properties can 
best be seen by developing Pythagoras’ theorem in a threeclimensional coordinate form. Consider 
the square of the distance between the points P [xi , yi , zi] and Q [X 2 , y 2 , Z 2 ] in Cartesian coordinates: 
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We have 

(PQ)2 = (X2-Xl)2 + (y2-yi)2 + (Z2-Zl)2 


= X2 2 - 2x1X2 + xi 2 + Y2 2 — 2yiy2 + yi 2 + Z2 2 - 2ziZ 2 + zi 2 
= (xi 2 + yi 2 + zi 2 ) + (X2 2 + Y2 2 + Z2 2 ) ■ - 2 (xiX 2 + yiy2 + Z1Z2) 

= (OP f + (OQ f -2(xiX2+yiy 2 +ziZ2) (1.2) 

Tine lengths PQ, OP, OQ, and their squares, are invariants under rotations and therefore the entire 
right-hand side of this equation is an invariant. Tine admixture of the coordinates (X1X2 + yiy2 + Z1Z2) is 
therefore an invariant under rotations. This term has a geometric interpretation: in the triangle OPQ, we 
have the generalized Pythagorean theorem 


(PQ) 2 = (OP) 2 + (OQ) 2 - 20P.0Q cosa, 


therefore 


OP.OQ cosa = X 1 X 2 +yiy 2 + Z 1 Z 2 = the scalar product. 


(1.3) 
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Invariants in space-time with scalar-product-like forms, such as the interval between events 
(see 3.3), are of fundamental importance in the Theory of Relativity. Although rotations in space are 
part of our everyday experience, the idea of rotations in space-time is counter-intuitive. In Chapter 3, 
this idea is discussed in terms of he relative motion of inertial observers. 

1 .3 Elements of differential geometry 

Nature does not prescibe a particular coordinate system or mesh. We are free to select the 
system that is most appropriate for the problem at hand. In the familiar Cartesian system in which the 
mesh lines are orthogonal, equidistant, straight lines in the plane, the key advantage stems from our 
ability to calculate distances given the coordinates - we can apply Pythagoras’ theorem, directly. 
Consider an arbitrary mesh: 
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Given the point P [3 U , 4v], we cannot use Pythagoras’ theorem to calculate the distance OP. 

In the infinitesimal parallelogram shown, we might think it appropriate to write 
ds 2 = du 2 + dv 2 + 2dudvcosa . (ds 2 = (ds) 2 , a squared “length” ) 

This we cannot do! The differentials du and dv are not lengths - they are simply differences between 
two numbers that label the mesh. We must therefore multiply each differential by a quantity that 
converts each one into a length. Introducing dimensioned coefficients, we have 

ds 2 = gndu 2 +2gi2dudv+g22dv 2 (1.4) 

where Vgn du and Vg^ dv are now lengths. 

fine problem is therefore one of finding general expressions for the coefficients; 
it was solved by Gauss, the pre-eminent mathematician of his age. We shall restrict our discussion to 
the case of two variables. Before treating this problem, it will be useful to recall the idea of a total 
differential associated with a function of more than one variable. 

Let u = f(x, y) be a function of two variables, x and y. As x and y vary, the corresponding values of u 
describe a surface. For example, if u = x 2 + y 2 , the surface is a paraboloid of revolution. The partial 
derivatives of u are defined by 

af(x, y)/ax = limit as h -*0 {(f(x + h, y) - f(x, y))/h} (treat y as a constant), (1 .5) 

and 

af(x, y)/ay = limit as k -*0 {(f(x, y + k) - f(x, y))/k} (treat x as a constant). (1 .6) 

For example, if u = f(x, y) = 3x 2 + 2y 3 then 


af/ax = 6x, aw = 6, aT/ax 3 = o 
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and 

df/dy = 6y 2 , c)2f/c)y 2 = 12y, ri^/riy 3 = 12, and rTfdy 4 = 0. 

If u = f(x, y) then the total differential of the function is 
du = (df/dx)dx+(df/dy)dy 

corresponding to the changes: x -»> x + dx and y -> y + dy. 

(Note that du is a function of x, y, dx, and dy of the independent variables x and y) 

1 A Gaussian coordinates and the invariant line element 

Consider the infinitesimal separation between two points P and Q that are described in either 
Cartesian or Gaussian coordinates: 

A 

y+dy v 

, tis 

y P^- ^ v P 

x x+dx u u+du 

Cartesian Gaussian 

In the Gaussian system, du and dv do not represent distances. 

Let 

x=f(u,v)andy=F(u,v) (1 .7 a,b) 

then, in the infinitesimal limit 

dx = (dx/du)du + (dx/dv)dv and dy = (dy/du)du + (dy/dv)dv. 
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In the Cartesian system, there is a direct correspondence between the mesh-numbers and distances : 
ds 2 = dx 2 +dy 2 . (1.8) 

But 

dx 2 = (dx/d u)2du 2 + 2(ax/au)(ax/av)dudv + (ax/av)W and 
dy 2 = (dy/du)2du 2 + 2(ay/au)(ay/av)dudv + (ay/av)W. 

We therefore obtain 

ds 2 = {(ax/a uf + (ay/au)2}du 2 + 2{(ax/au)(ax/av) + (ay/au)(ay/av)}dudv 
+{(ax/av) 2 +(ay/av)2}dv 2 

= gi i du 2 + 2gi2dudv + g 22 dv 2 . (1 .9) 

If we put u = ui and v = U 2 , then 

ds 2 = E E9 udu idu j where ij = 1,2, (ageneral form in n-dimensional space: i, j = 1, 2, 3, ...n) (1.10) 

i j 

Two important points connected with Ms invariant differential line element are: 

1. Interpretation of the coefficients gij : consider a Euclidean mesh of equispaced parallelograms: 



InPQR 


ds 2 = 1 .du 2 + 1 .dv 2 + 2cosadudv 
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= gndu 2 +g 22 dv 2 + 2 gi 2 dudv (1.11) 

therefore, gn = g^ = 1 (the mesh-lines are equispaced) 
and 

gi 2 = cosa where a is the angle between the u-v axes. 

We see that if the mesh-lines are locally orthogonal then gi 2 = 0. 

2. Dependence of the gft on the coordinate system and the local values ofu, v. 

A specific example will illustrate the main points of this topic: consider a point P described in 
three coordinate systems - Cartesian P [x, y], Polar P [r, <j>], and Gaussian P [u, v] - and the square 
ds 2 of the line element in each system. 

The transformation [x, y] -»> [r, $ is 

x = ncoscj) and y = rsincj). (1.12 a,b) 

The transformation [r, <f)] -> [u, v] is direct, namely 
r=uand(i)=v. 

Now, 

ax/a r=cos4>, ay/ar = sintj), ax/acj) = — nsin(|), ay/acj) = rcos4> 
therefore, 

ax/a u = cosv, ay/au = sinv, ax/av = -usinv, ay/av = ucosv. 

Tie coefficients are therefore 


gn = cos2v + sin2v = 1 , 
g 22 = (-usinv) 2 +(ucosv)2 = u 2 , 


(1.13 a-c) 
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and 

gi 2 = cos(-usinv) + sinv(ucosv) = 0 (an orthogonal mesh). 

We therefore have 

ds 2 = dx 2 +dy 2 (1.14a-c) 

= du 2 + u 2 dv 2 
= dr 2 +r 2 d()) 2 . 

In this example, the coefficient = f(u). 

The essential point of Gaussian coordinate systems is that the coefficients g ij completely 
characterize the surface - they are intrinsic features. We can, in principle, determine the nature of a 
surface by measuring the local values of the coefficients as we move over the surface. We do not 
need to leave a surface to study its form. 

1 .5 Geometry and groups 

Felix Klein (1849-1925), introduced his influential Erianger Program in 1872. In this program, 
Geometry is developed from the viewpoint of the invariants associated with groups of transformations. 
In Euclidean Geometry, the fundamental objects are taken to be rigid bodies that remain fixed in size 
and shape as they are moved from place to place. The notion of a rigid body is an idealization. 

Klein considered transformations of the entire plane - mappings of the set of all points in the 
plane onto itself. The proper set of rigid motions in the plane consists of translations and rotations. A 
reflection is an improper rigid motion in the plane; it is a physical impossibility in the plane itself. The set 
of all rigid motions - both proper and improper - forms a group that has the proper rigid motions as a 
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subgroup. A group G is a set of distinct elements {g} for which a law of composition “ • ” is given such 
that the composition of any two elements of the set satisfies: 

Closure : if g ,, g j belong to G then g k = g i ° g j belongs to G for all elements g i, g j , 
and 

Associativity, for all g i, g j, g k in G, gi°(gj°g k ) = (g i»g j)»g k. . 

Furthermore, the set contains 

A unique identity, e, such that g i e = e g , = g i for all g , in G, 
and 

A unique inverse, g r 1 , for every element g in G, 
such that g i ° g r 1 = g r 1 ° g i = e. 

A group that contains a finite number n of distinct elements g n is said to be a finite group of order n. 

fine set of integers Z is a subset of the reals R; both sets form infinite groups under the 
composition of addition. Z is a “subgroupfof R. 

Permutations of a set X form a group Sx under composition of functions; if a: X -> X and b: X 
X are permutations, the composite function ab: X -» X given by ab(x) = a(b(x)) is a permutation. If 
the set X contains the first n positive numbers, the n! permutations form a group, the symmetric group, 
S n . For example, the arrangements of the three numbers 1 23 form the group 


S 3 = { 1 23, 31 2, 231 , 1 32, 321 , 21 3 }. 
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If the vertices of an equilateral triangle are labelled 123, the six possible symmetry 
arrangements of the triangle are obtained by three successive rotations through 1 20 about its center of 
gravity, and by the three reflections in the planes I, II, III: 



II III 

This group of “isometries" of the equilateral triangle (called the dihedral group, Ds) has the same 
structure as the group of permutations of three objects. The groups & and D 3 are said to be 
isomorphic. 

According to Klein, plane Euclidean Geometry is the study of those properties of plane rigid 
figures that are unchanged by the group of isometries. (The basic invariants are length and angle). In 
his development of the subject, Klein considered Similarity Geometry that involves isometries with a 
change of scale, (the basic invariant is angle), Affine Geometry, in which figures can be distorted under 
transformations of the form 


x' = ax+by+c 
y' = dx+ey+f, 


(1.15a,b) 
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where [x, y] are Cartesian coordinates, and a, b, c, d, e, f, are real coefficients, and Projective 
Geometiy, in which all conic sections: circles, ellipses, parabolas, and hyperbolas can be transformed 
into one another by a projective transformation. 

It will be shown that the Lorentz transformations - the fundamental transformations of events in space 
and time, as described by different inertial observers- form a group. 

1.6 Vectors 

The idea that a line with a definite length and a definite direction — a vector— can be used to 
represent a physical quantity that possesses magnitude and direction is an ancient one. The 
combined action of two vectors A and B is obtained by means of the parallelogram law, illustrated h 
the following diagram 



The diagonal of the parallelogram formed by A and B gives the magnitude and direction of the 
resultant vector C. Symbolically, we write 

C=A+B (1.16) 

in which the “=” sign has a meaning that is clearly different from its meaning in ordinary arithmetic. 
Galileo used this empirically-based law to obtain the resultant force acting on a body. Although a 
geometric approach to the study of vectors has an intuitive appeal, it will often be advantageous b use 
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the algebraic method - particularly in the study of Einstein’s Special Relativity and Maxwell’s 
Electromagnetism. 

1.7 Quaternions 

In the decade 1830-1 840, the renowned Hamilton introduced new kinds of 
numbers that contain four components, and that do not obey the commutative property of 
multiplication. He called the new numbers quaternions. A quaternion has the form 

u+xi+yj+zk (1.17) 

in which the quantities i, j, k are akin to the quantity i = V-1 in complex numbers, x + iy. fine 
component u forms the scalar part, and the three components xi + yj + zk form the vector part of the 
quaternion. The coefficients x, y, z can be considered to be the Cartesian components of a point P in 
space, fine quantities i, j, k are qualitative units that are directed along the coordinate axes. Two 
quaternions are equal if their scalar parts are equal, and if their coefficients x, y, z of i, j, k are 
respectively equal. The sum of two quaternions is a quaternion. In operations that involve quaternions, 
the usual rules of multiplication hold except in those terms in which products of i, j, k occur— in these 
terms, the commutative law does not hold. For example 

j k= i, kj =— i, ki=j, ik=-j, ipk, ji=-k, (1.18) 

(these products obey a right-hand rule), 
and 

i 2 = j 2 = k 2 = -1 . (Note the relation to i 2 =— 1). (1.19) 

fine product of two quaternions does not commute. For example, if 
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p = 1 + 2i + 3j + 4k, and q = 2 + 3i + 4j + 5k 
then 

pq = — 36 + 6i + 1 2j + 1 2k 
whereas 

qp = — 36 + 23i - 2j + 9k. 

Multiplication is associative. 

Quaternions can be used as operators to rotate and scale a given vector into a new vector: 

(a + bi + cj + dk)(xi + yj + zk) = (x'i + y j + z'k) 

If the law of composition is quatemionic multiplication then the set 
Q={±1,±i,±j, ±k} 

is found to be a group of order 8. It is a norvcommutative group. 

Hamilton developed the Calculus of Quaternions. He considered, for example, the properties 
of the differential operator: 

V = i(d/3x) + j(3/dy) + k(d/3z). (1.20) 

(He called this operator “nabla”). 

If f(x, y, z) is a scalar point function (single-valued) then 
Vf = p/3x) +p/3y) + k(df/az) , a vector. 

If 


V = Vii + V2j + V3k 
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is a continuous vector point function, where the v’s are functions of x, y, and z, Hamilton introduced the 
operation 

Vv = (id/3x + j3/3y + kd/dz)(vii + V 2 j + V 3 k) (1 .21 ) 

= - (dvi/3x + <9v2/<9y + dv^Jdz) 

+ (dv 3 /Sy - 5v2/<9z)i + (3vi/dz - dv 3 /Sx)j + (Sv^dx - 3vi/3y)k 
= a quaternion. 

fine scalar part is the negative of the “divergence of v” (a term due to Clifford), and the vector part is the 
“curt of v” (a term due to Maxwell). Maxwell used the repeated operator V 2 , which he called the 
Laplacian. 

1.8 3 -vector analysis 

Gibbs, in his notes for Yale students, written in the period 1881 - 1884, and Heaviside, in articles 
published in the Electrician in the 1880’s, independently developed ^dimensional Vector Analysis as a 
subject in its own right— detached from quaternions. 

In the Sciences, and in parts of Mathematics (most notably in Analytical and Differential Geometry), 
their methods are widely used. Two kinds of vector multiplication were introduced: scalar multiplication 
and vector multiplication. Considertwo vectors v and v' where 
v=viei+V2e2+V3e3 
and 


v'=vi'ei+V2'e2+V3'e3. 
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Tine quantities ei, 62 , and e 3 are vectors of unit length pointing along mutually orthogonal axes, labeled 
1,2, and 3. 

i) The scalar multiplication of v and v' is defined as 

v • v' = ViVi ' + V2V2' + V3V3', ( 1 . 22 ) 

where the unit vectors have the properties 

erei = 62 - 62 = 63 - 63 = 1 , (1.23) 

and 

ere2=e2-ei=ere3=e3-ei=e2-e3=e3-e2=0. (1.24) 

The most important property of the scalar product of two vectors is its invariance under 
rotations and translations of the coordinates. (See Chapterl). 

ii) Tine vector product of two vectors v and v' is defined as 

ei e 2 e3 

vxv'= vi V 2 V 3 (where |. .. |is the determinant) (1.25) 

Vi' V2' V3' 

= (V2 V3' - V3V2>1 + (V3V1 ' - VlV3')e2 + (V1V2' - V2V1 >3 . 

Tie unit vectors have the properties 

eixei = e2xe2=e3xe3=0 (1.26a, b) 

(note that these properties differ from the quatemionic products of the i, j, k’s), 
and 

eixe2=e3,e2xei=-e3,e2xe3=ei,e3xe2=-ei,e3xei=e2,eixe3=-e2 
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These non-commuting vectors, or “cross products” obey the standard right-hand-rule. 

The vector product of two parallel vectors is zero even when neither vector is zero. 

Tie non-associative property of a vector product is illustrated in the following example 
ei xe 2 xe 2 =(ei xe 2 )xe 2 =e 3 xe 2 =-ei 
=eix(e 2 xe 2 ) = 0 . 

Important operations in Vector Analysis that follow directly from those introduced in the theory 
of quaternions are: 

1 ) the gradient of a scalar function f(xi , X 2 , X 3 ) 

Vf = (5f/5xi)ei + (5f/5x 2 )e 2 + ( 5 f/ 5 x 3 )e 3 , (1 .27) 

2 ) the divergence of a vector function v 

V • v = dvi/dxi + 5 v 2 / 5 x 2 + 5 v 3 / 5 x 3 (1 .28) 

where v has components vi, v 2 , V 3 that are functions of xi, X 2 , X 3 , and 

3) the curl of a vector function v 


ei e 2 e3 


Vxv- 


5/5x 1 < 9 / 5 x 2 5 / 5 x 3 . 


Vi V2 V3 


(1.29) 


The physical significance of these operations is discussed later. 

1 .9 Linear algebra and n-vectors 

A major part of Linear Algebra is concerned with the extension of the algebraic properties of 
vectors in the plane (2-vectors), and in space (3-vectors), to vectors in higher dimensions (rwectors). 
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This area of study has its origin in the work of Grassmann (1809 - 77), who generalized the quaternions 
(4-component hyper-complex numbers), introduced by Hamilton. 

An n-dimensional vector is defined as an ordered column of numbers 




Xn = 


Xi 

x> 


I X! J 


(1.30) 


It will be convenient to write this as an ordered row in square brackets 

Xn = [Xl,X2,...Xn]. (1.31) 

fine transpose of the column vector is the row vector 

Xn T = (Xl,X2,...Xn). (1.32) 

fine numbers xi, X 2 , ...Xn are called the components of x, and the integer n is the dimension of 
x. fine order of the components is important, for example 
[1,2, 3] *[2, 3,1], 

Tie two vectors x = [xi, X 2 , ...Xn] and y = [yi, y 2 , ...y n ] are equal if 
x=y (i = 1 to n). 

Tie laws of Vector Algebra are 

1. x+y=y+x. (1 .33 a-e) 

2. [x+y]+z=x+[y+z] . 

3. a[x + y] = ax + ay where a is a scalar . 

4. (a + b)x = ax + by where a,b are scalars . 
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5. (ab)x = a(bx) where a,b are scalars . 

If a = 1 and b = — 1 then 
x+[-x]=0, 

where 0 = [0, 0, ...0] is the zero vector. 

The vectors x = [xi, X 2 , ...Xn] and y = [yi,y 2 ...y n ] can be added to give their sum or resultant: 
x + y = [xi + yi, X 2 + y 2 , ...,Xn + y n ]. (1.34) 

The set of vectors that obeys the above rules is called the space of all rwectors or the vector 
space of dimension n. 

In general, a vector v = ax + by lies in the plane of x and y. The vector v is said to depend 
linearly on x and y — it is a linear combination of x and y. 

A k-vector v is said to depend linearly on the vectors m, 112 , ...Uk if there are scalars a such that 
v=aiUi+a2U2+...akUk. (1.35) 

For example 

[3, 5, 7] = [3, 6, 6] + [0, -1 , 1] = 3[1 , 2, 2] + 1 [0, -1 , 1], a linear combination of the vectors [1 , 2, 2] 
and [0-1,1], 

A set of vectors m, 112 , ...Uk is called linearly dependent if one of these vectors depends linearly 
on the rest. For example, if 

ui = a 2 ii 2 + a3ii3 + ...+ akUk., (1 .36) 


the set ui, ...Uk is linearly dependent. 
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If none of the vectors ui, 112, ...Uk can be written linearly in terms of the remaining ones we say 
that the vectors are linearly independent. 

Alternatively, the vectors m, 112, ...Ukare linearly dependent if and only if there is an equation of 

the form 

CiUl+C 2 ll 2 + ...CkUk = 0 , (1.37) 

in which the scalars q are not all zero. 

Consider the vectors e obtained by putting the ^component equal to 1, and all the other 
components equal to zero: 
ei = [1 , 0, 0, ...0] 
e2= [0,1,0, ...0] 

then every vector of dimension n depends linearly on ei, 62 , ...e n , thus 

X=[Xi,X2,...Xn] 

=xiei+X 2 e 2 +...Xne n . (1.38) 

fine er's are said to span the space of all n-vectors; they form a basis. Every basis of an n-space has 
exactly n elements, fine connection between a vector x and a definite coordinate system is made by 
choosing a set of basis vectors e . 

1 .1 0 The geometry of vectors 

The laws of vector algebra can be interpreted geometrically for vectors of dimension 2 and 3. 
Let the zero vector represent the origin of a coordinate system, and let the 2-vectors, x and y, 
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correspond to points in the plane: P [xi, X2] and Q |yi, y^]. The vector sum x + y is represented by the 
point R, as shown 



R is in the plane OPQ, even if x and y are 3-vectors. 

Every vector point on the line OR represents the sum of the two corresponding vector points on the 
lines OP and OQ. We therefore introduce the concept of the directed vector lines OP, OQ, and OR, 
related by the vector equation 

OP + OQ = OR . (1.39) 

A vector V can be represented as a line of length OP pointing in the direction of the unitvectorv, thus 



A vector V is unchanged by a pure displacement: 
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where the “=” sign means equality in magnitude and direction. 

Two classes of vectors will be met in future discussions; they are 

1. Polar vectors : the vector is drawn in the direction of the physical quantity being represented, for 
example a velocity, 

and 

2. Axial vectors, the vector is drawn parallel to the axis about which the physical quantity acts, for 
example an angular velocity. 

Tlie associative property of the sum of vectors can be readily demonstrated, geometrically 



We see that 

V=A + B + C = (A+B) + C=A+(B + C) = (A+C) + B. (1.40) 

Tine process of vector addition can be reversed; a vector V can be decomposed into the sum of n 
vectors of which (n - 1) are arbitrary, and the rf 1 vector closes the polygon. The vectors need not be in 
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the same plane. A special case of this process is the decomposition of a 3-vector into its Cartesian 
components. 


A general case A special case 



V 2 


Vi, V 2 , V3, V4 : arbitrary V z closes the polygon 

V5 closes the polygon 

The vector product of A and B is an axial vector, perpendicular to the plane containing A and B. 

z 



AxB=ABsinan=-BxA (1.41) 

1 .1 1 Linear Operators and Matrices 

Transformations from a coordinate system [x, y] to another system [x', y], without shift of the 
origin, or from a point P [x, y] to another point P' [x', y ], in the same system, that have the form 



x' = ax+by 
y' = cx+dy 
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where a, b, c, d are real coefficients, can be written in matrix notation, as follows 



Symbolically, 

x' = Mx, (1.42) 

where 

x = [x, y], and x' = [x', y], both column 2-vectors, 
and 



a 2 x 2 matrix operator that “changes” [x, y] into [x', y]. 

In general, M transforms a unit square into a parallelogram: 



This transformation plays a key role in Einstein’s Special Theory of Relativity (see later discussion). 
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1.12 Rotation operators 

Consider the rotation of an x, y coordinate system about the origin through an angle 0: 



From the diagram, we see that 
x'= xcostj) + ysincj) 


y' = — xsincj) + ycoscj) 


r - > s 

x' 


r 

COS^ 

sincj) 


r ~s 

X 

y' 

V J 


-sind) 

cos4> 

j 


JO 


Symbolically, 

P'=9ftd(<t>)P 


x 


( 1 . 43 ) 


where 
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cos4> sir*)) 

mjS = is the rotation operator. 

-sincj) cos4> 

The subscript c denotes a rotation of the coordinates through an angle +c|) . 

The inverse operator, is obtained by reversing the angle of rotation: +c|>^--(|). 
We see that matrix product 

where the superscript T indicates the transpose (rows o columns), and 

r 

1 0 

1= is the identity operator. 

0 1 

v. J 

Eq.(1 .44) is the defining property of an orthogonal matrix. 

If we leave the axes fixed and rotate the point P[x, y] to P'[x', y], then 

we have 

y 
y' 

y 

0 x' x x 

From the diagram, we see that 

x' = xcos(|) - ysincp, and y' = xsincp + ycoscp 



(1.44) 


(1.45) 
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or 

P'=mv{ct))P (1.46) 

where r 

cos4> — sin4> 

di^) = , the operator that rotates a vector through -Kp. 

sincp cos4> 

V J 

1.13 Components of a vector under coordinate rotations 

Consider a vector V [v x , v y ], and the same vector V' with components [v x ,vy], in a 
coordinate system (primed), rotated through an angle -Kp. 

y' 


0, 0' v x x 

We have met the transformation [x, y] -> [x', y] under the operation here, we have the 
same transformation but now it operates on the components of the vector, v x and v y , 

[Vx , Vy] = 3ftd((p)[v x , Vy]. (1.47) 

PROBLEMS 

1-1 i) If u = 3 x'V show that du/3x = (3 ^In3)/y and du/3y = (-3 Yj h\n3)lf. 
ii) If u = ln{(x 3 + yj/x 2 } show that du/3x = (x 3 - 2y)/(x(x 3 +y)) and <9u/<9y = 1/(x 3 + y). 
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1-2 Calculate the second partial derivatives of 

f(x, y) = (1/Vy)exp{-(x - a) 2 /4y}, a = constant. 

1 -3 Check the answers obtained in problem 1-2 by showing that the function f(x, y) in 
1-2 is a solution of the partial differential equation dT/^-df/dy = 0. 

1 4 If f(x, y, z) = 1/(x 2 + y 2 + z 2 ) 1/2 = 1/r, show that f(x, y, z) = 1/r is a solution of Laplace’s 
equation 

ST/Sx 2 + ST/dy 2 + cffldz 2 = 0. 

This important equation occurs in many branches of Physics. 

1-5 At a given instant, the radius of a cylinder is r(t) = 4cm and its height is h(t) = 10cm. 

If r(t) and h(t) are both changing ata rate of 2 cm.s- 1 , show that the instantaneous 
increase in the volume of the cylinder is 192 tt cnf.s- 1 . 

1-6 The transformation between Cartesian coordinates [x, y, z] and spherical polar 
coordinates [r, 0, c()] is 

x = rsinGcoscj), y = rsinOsincj), z = rcosG. 

Show, by calculating all necessary partial derivatives, that the square of the line 
element is 

ds 2 = dr 2 + r^inTJdcl) 2 + rW. 

Obtain this result using geometrical arguments. This form of the square of the line element will be 
used on several occasions in the future. 

1 -7 Prove that the inverse of each element of a group is unique. 
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1 -8 Prove that the set of positive rational numbers does not form a group under division. 

1-9 A finite group of order n has n 2 products that may be written in an nxn array, called the group 
multiplication table. For example, the 4 th-roots of unity {e, a, b, c} = {± 1 , ±i}, where i = V- 1 , forms a 
group under multiplication (1 i = i, i(— i) = 1 , i 2 = — 1 , (-i) 2 = -1 , etc. ) with a multiplication table 

e = 1 a = i b=-1 c=-i 
el i -1 -i 

a i -1 -i 1 

b -1 -i 1 i 

c -i 1 i -1 

In this case, the table is symmetric about the main diagonal; this is a characteristic feature of a group in 
which all products commute (ab = ba) — it is an Abelian group. 

If G is the dihedral group D3, discussed in the text, where G = {e, a, a 2 , b, c, d}, where e is the 
identity, obtain the group multiplication table. Is it an Abelian group?. Notice that the three elements {e, 
a, a 2 } form a subgroup of G, whereas the three elements {b, c, d} do not; there is no identity in this 
subset. 

The group D3 has the same multiplication table as the group of permutations of three objects. 
This is the condition that signifies group isomorphism. 

1-10 Are the sets 

i) {[0,1,1], [1,0,1], [1,1,0]} 


and 
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ii) {[1,3, 5, 7], [4 -3, 2,1], [2,1, 4, 5]} 
linearly dependent? Explain. 

1-11 i) Prove that the vectors [0, 1 , 1 ], [1 , 0, 1 ], [1 , 1 , 0] form a basis for Euclidean space 
R 3 . 

ii) Do the vectors [1 , i] and [i, -1 ], (i = V-1 ), form a basis for the complex space C 2 ? 

1 -1 2 Interpret the linear independence of two 3-vectors geometrically. 

1-13 i) If X =[1,2, 3] and Y = [3, 2, 1 ], prove that their cross product is orthogonal to theX-Y plane. 

ii) IfX and Y are 3-vectors, prove thatXxY = 0 iff X and Y are linearly dependent. 

1-14 If 

r 

an ai 2 ai3 

T = a2i a22 a23 

0 0 1 

represents a linear transformation of the plane under which distance is an invariant, 
show that the following relations must hold : 


an 2 + a 2 i 2 - 312 2 + a 22 2 - 1 , and anai 2 + 821822 - 0 . 
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2 

KINEMATICS: THE GEOMETRY OF MOTION 

2.1 Velocity and acceleration 

The most important concepts in Kinematics— a subject in which the properties of the forces 

responsible for the motion are ignored — can be introduced by studying the simplest of all motions, 

namely that of a point P moving in a straight line. 

Let a point P [t, x] be at a distance x from a fixed point 0 at a time t, and let it be at a point 

P' ft', x] = P'[ t + At x + Ax] at a time At later. The average speed of P in the interval At is 

<v p > = Ax/At. (2.1) 

If the ratio Ax/At is not constant in time, we define the instantaneous speed of P at time t as the limiting 
value of the ratio as At -»• 0: 

v p = v p (t) = limit as At -» 0 of Ax/At = dx/dt = x = v x . 

Tie instantaneous speed is the magnitude of a vector called the instantaneous velocity of P: 

v = dx/dt , a quantity that has both magnitude and direction. (2.2) 

A space-time curve is obtained by plotting the positions of P as a function oft: 
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Tine tangent of the angle made by the tangent to the curve at any point gives the value of the 
instantaneous speed at the point. 

The instantaneous acceleration, a , of the point P is given by the time rateof-change of the velocity 

•• 

a = dv/dt = d(dx/dt)/dt = d 2 x/dt 2 = x . (2.3) 

A change of variable from t to x gives 

a = dv/dt = dv(dx/dt)/dx = v(dv/dx). (2.4) 

This is a useful relation when dealing with problems in which the velocity is given as a function of the 
position. For example 



The gradient is dv/dx and tana = dv/dx, therefore 

NQ, the subnormal, = v(dv/dx) = ap, the acceleration of P. (2.5) 

The area under a curve of the speed as a function of time between the times ti and fe is 
[A] [tijq = 1 pm v(t)dt = J pig (dx/dt)dt =W]dx = (x 2 - xi) 


= distance traveled in the time t? - ti . 


( 2 . 6 ) 
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The solution of a kinematical problem is sometimes simplified by using a graphical method, for 
example: 

A point A moves along an x-axis with a constant speed va. Let it be at the origin 0 (x = 0) at 
time t = 0. It continues for a distance xa, at which point it decelerates at a constant rate, finally stopping 
at a distance X from 0 at time T. 

A second point B moves away from 0 in the +x-direction with constant acceleration. Let it 
begin its motion at t = 0. It continues to accelerate until it reaches a maximum speed vb™* at a time 
fet™* when at xb 1 ™ from 0. At xb™, it begins to decelerate at a constant rate, finally stopping at X at 
time T: To prove that the maximum speed of B during its motion is 

vb^x = va{1 - (x^/2X)}- 1 , a value that is independent of the time at which the 
maximum speed is reached. 

Tlie velocity-time curves of the points are 



fine areas under the curves give X = vaU + va(T - tx)/2 = vb f*T/2, so that 



vb™* = va(1 + (WT)), but vaT =2X-xa, therefore vb™* = va{1 - (XV2X )}- 1 1 f^™*). 

2.2 Differential equations of kinematics 

If the acceleration is a known function of time then the differential equation 
a(t) = dv/dt 

can be solved by performing the integrations (either analytically or numerically) 
ja(t)dt=jdv 

If a(t) is constant then the result is simply 

at + C = v, where C is a constant that is given by the initial conditions. 

Let v = u when t = 0 then C = u and we have 
at+u=v. 

This is the standard result for motion under constant acceleration. 

We can continue this approach by writing: 
v = dx/dt = u + at. 

Separating the variables, 
dx = udt + atdt. 

Integrating gives 

x=ut+(1/2)at 2 +C' (for constant a). 

Ifx = 0 when t = 0 then C' = 0, and 
x(t) = ut+(1/2)at 2 . 

Multiplying this equation throughout by 2a gives 
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2ax = 2aut + (at) 2 
= 2aut+(v-u)2 

and therefore, rearranging, we obtain 
v 2 = 2ax-2aut+2vu -u 2 
= 2ax+2u(v-at)-u 2 

= 2ax+u 2 . (2.11) 

In general, the acceleration is a given function of time or distance or velocity: 

1) lfa=f(t)then 

a = dv/dt =f(t), (2.12) 

dv=f(t)dt, 

therefore 

v = Jf(t)dt + C(a constant). 

This equation can be written 
v = dx/dt = F(t) + C, 

therefore 

dx=F(t)dt+Cdt. 

Integrating gives 

x(t)=jF(t)dt+Ct+C'. (2.13) 

Tine constants of integration can be determined if the velocity and the posiion are known at a given 


time. 
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2) If a = g(x) = v(dv/dx) then (2.14) 

vdv = g(x)dx. 

Integrating gives 

v 2 = 2jg(x)dx + D, 

therefore 

v 2 = G(x) + D 

so that 

v=(dx/dt) = ±V(G(x) + D). (2.15) 

Integrating this equation leads to 

±Jdx/{V(G(x) + D)} = t + D'. (2.16) 

Alternatively, if 

a = d 2 x/dt 2 = g(x) 

then, multiplying throughout by 2(dx/dt)gives 
2(dx/dt)(d2x/dt 2 ) = 2(dx/dt)g(x). 

Integrating then gives 

(dx/dt) 2 = 2jg(x)dx + D etc. 

As an example of this method, consider the equation of simple harmonic motion (see later discussion) 
d 2 x/dt 2 =-oa 2 x. (2.17) 

Multiply throughout by 2(dx/dt), then 
2(dx/dt)d 2 x/dt 2 =-2oa 2 x(dx/dt). 
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This can be integrated to give 

(dx/dt) 2 =-co 2 x 2 +D. 

If dx/dt = 0 when x = A then D = go 2 A 2 , therefore 
(dx/dt) 2 = go 2(A 2 - x 2 ) = v 2 , 

so that 

dx/dt =± goV(A 2 -x 2 ). 

Separating the variables, we obtain 

- dx/{V(A 2 - x 2 )} = codt (The minus sign is chosen because dx and dt have opposite signs). 
Integrating, gives 

cos- 1 (x/A) = oot+D'. 

But x = A when t = 0, therefore D' = 0, so that 

x(t) = Acos(oot), where A is the amplitude. (2.18) 

3) If a = h(v), then (2.19) 

dv/dt = h(v) 

therefore 

dv/h(v) = dt, 


Jdv/h(v)=t+B. (2.20) 

Some of the techniques used to solve ordinary differential equations are discussed in 


Appendix A. 
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2.3 Velocity in Cartesian and polar coordinates 

Tine transformation from Cartesian to Polar Coordinates is represented by the linear equations 
x = ncoscf) and y = rsincj), (2.21 a,b) 

or 

x=f(r,(|)) and y = g(r, 4>). 

The differentials are 

dx = (3f/3r)dr + (<9f/<9cj))dc|) and dy = (3g/dr)dr + (3g/3c|))d(|). 

We are interested in the transformation of the components of the velocity vector under 
[x, y] -» [r, <j)]. The velocity components involve the rates of change of dx and dy with respect to time: 
dx/dt = (®)dr/dt + (3f/3(|))d(|Vdt and dy/dt = (3g/3r)dr/dt + (3g/a|))dc|)/dt 


or 


x = (3f/3r)r + (3f/a|))c|) and y = (3g/3r)r + (3g/a|))(|). 


But, 


3f/3r = coscj), 3f/3(|) = -rsincj), 3g/3r = sincj), and dg/drj) = rcoscp, 


therefore, the velocity transformations are 
• • • 
x = coscj) r - sincj)(r cj>) = v x 


and 


y = sincj) r + cosc|>(r cp) = v y . 


( 2 . 22 ) 


(2.23) 

(2.24) 


These equations can be written 
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Changing 0 -> -<J), gives the inverse equations 



0 x 


The quantity dcjVdt is called the angular velocity of P about the origin 0. 

2.4 Acceleration in Cartesian and polar coordinates 

We have found that the velocity components transform from [x, y] to [r, (|>] coordinates as 

follows 


v x = coscj) r — sincj)(r 4>) = x 



and 


v y = sincj) r + cx>s4>(r c|)) = y. 
The acceleration components are given by 
a x = dv>/dt and v y =dv/dt 

We therefore have 

• • 

a x = (d/dtXcoscp r-sin4>(r c|))} 



0 
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( 2 . 26 ) 


( 2 . 27 ) 


( 2 . 28 ) 


x 
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These expressions for the components of acceleration will be of key importance in discussions of 
Newton’s Theory of Gravitation. 

We note that, if r is constant, and the angular velocity go is constant then 


= r (p = rco = 0, 

(2.29) 

& 

i 

ii 
1 

II 

1 

II 

%- 

• t 

1 

II 

* 

(2.30) 

• 

v^= r( t)=rco. 

(2.31) 


These equations are true for c/rcu/ar motion. 


PROBLEMS 

2-1 A point moves with constant acceleration, a, along the x-axis. If it moves distances Axi 
and AX 2 in successive intervals of time Ati and At, prove that the acceleration is 
a = 2 (v 2 -vi)T 

where vi = Axi/Ati, V 2 = AdAk and T = Ati + At. 

2-2 A point moves along the x-axis with an instantaneous deceleration (negative 
acceleration): 

a(t) oc -v^t) 

where v(t) is the instantaneous speed at time t, and n is a positive integer. If the 
initial speed of the point is u (at t = 0), show that 

knt = {(u n - v n )/(uv) n }/n, where kn is a constant of proportionality, 
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and that the distance travelled, x(t), by the point from its initial position is 
knx(t) = {(if- 1 - ^/(uv)"- 1 }/^ - 1 ). 

2-3 A point moves along the x-axis with an instantaneous deceleration kv^t), where v(t) is 
the speed and k is a constant. Show that 
v(t) = u/(1 + kux(t)) 

where x(t) is the distance travelled, and u is the initial speed of the point. 

24 A point moves along the x-axis with an instantaneous acceleration 

d 2 x/dt 2 = — oc^/x 2 

where go is a constant. If the point starts from rest at x = a, show that the speed of 
the particle is 

dx/dt=-co{2(a-x)/(axp. 

Why is the negative square root chosen? 

2-5 A point P moves with constant speed v along the x-axis of a Cartesian system, and a 
point Q moves with constant speed u along the yaxis. At time t = 0, P is at x = 0, and 
Q, moving towards the origin, is at y = D. Show that the minimum distance, d™, 
between P and Q during their motion is 
dmh = D{1/(1 + (ll/v) 2 )}^. 

Solve this problem in two ways:1 ) by direct minimization of a function, and 2) by a 
geometrical method that depends on the choice of a more suitable frame of reference 
(for example, the rest frame of P). 



54 


2-6 Two ships are sailing with constant velocities u and v on straight courses that are 
inclined at an angle 0. If, at a given instant, their distances from the point of 
intersection of their courses are a and b, find their minimum distance apart. 

2-7 A point moves along the x-axis with an acceleration a(t) = kt 2 , where t is the time the 
point has been in motion, and k is a constant. If the initial speed of the point is u, 
show that the distance travelled in time t is 
x(t) = ut+(1/12)kt 4 . 

2-8 A point, moving along the x-axis, travels a distance x(t) given by the equation 
x(t) = aexp{kt} + bexp{-kt} 

where a, b, and k are constants. Prove that the acceleration of the point is 
proportional to the distance travelled. 

2-9 A point moves in the plane with the equations of motion 


r 

d 2 x/dt 2 


r 

-2 

1 


r ^ 

X 

d2y/dt 2 

V_ V 


1 

V 

-2 

j 


v/j 


Let the following coordinate transformation be made 
u = (x+y)/2andv=(x-y)/2. 

Show that in the u-v frame, the equations of motion have a simple form, and that the 
time-dependence of the coordinates is given by 
u = Acost + Bsint, 


and 



v = CcosV3 1 + DsinV3 t, where A, B, C, D are constants. 
This coordinate transfoimation has “diagonalized” the original matrix: 
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The matrix with zeros everywhere, except along the main diagonal, has the 
interesting property that it simply scales the vectors on which it acts — it does not 
rotate them. Tie scaling values are given by the diagonal elements, called the 
eigenvalues of the diagonal matrix. Tie scaled vectors are called eigenvectors. A 
small industry exists that is devoted to finding optimum ways of diagonalizing large 
matrices. Illustrate the motion of the system in the x-y frame and in the u-v frame. 
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3 

CLASSICAL AND SPECIAL RELATIVfTY 


3.1 The Galilean transformation 

Events belong to the physical world — they are not abstractions. We shall, nonetheless, 
introduce the idea of an ideal event that has neither extension nor duration. Ideal events may be 
represented as points in a space-time geometry. An event is described by a four-vector E[t, x, y, z] 
where t is the time, and x, y, z are the spatial coordinates, referred to arbitrarily chosen origins. 

Let an event E[t, x], recorded by an observer 0 at the origin of an x-axis, be recorded as the 
event E'[t', x] by a second observer O', moving at constant speed V along the x-axis. We suppose 
that their clocks are synchronized at t = t' = 0 when they coincide at a common origin, x = x' = 0. 

At time t, we write the plausible equations 
t'=t 
and 


x'=x-Vt, 

where Vt is the distance travelled by O' in a time t. These equations can be written 

E'=GE 


where 



( 3 . 1 ) 


G is the operator of the Galilean transformation. 



Tine inverse equations are 
t=t' 
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and 


or 


x =x'+Vt' 


E=G- 1 E' 


( 3 . 2 ) 


where G - 1 is the inverse Galilean operator. (It undoes the effect ofG). 

If we multiply t and t' by the constants k and k', respectively, where k and k'have dimensions 
of velocity then all terms have dimensions of length. 

In space-space, we have the Pythagorean form x 2 + y 2 = r 2 (an invariant under rotations). We 
are therefore led to ask the question: is (kt f + x 2 an invariant under G in space-time? Direct calculation 
gives 

(kt)2+x 2 = (k'ty + x' 2 + 2Vx't' + W 2 
= (k'ty+x' 2 only if V = 0 ! 

We see, therefore, that Galilean space-time does not leave the sum of squares invariant. We note, 
however, the key role played by acceleration in Galilean-Newtonian physics: 

fine velocities of the events according to 0 and O' are obtained by differentiating 
x' —— Vt + x with respect to time, giving 


v'=-V+v, 


( 3 . 3 ) 
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a result that agrees with everyday observations. 

Differentiating v' with respect to time gives 

dv7dt-a'=dv/dt=a (3.4) 

where a and a'are the accelerations in the two frames of reference. The classical acceleration is an 
invariant under the Galilean transformation. If the relationship v - v - V is used to describe the 

motion of a pulse of light, moving in empty space at v=c=3x10 8 m/s, it does not fit the facts. For 
example, if V is 0.5c, we expect to obtain v' = 0.5c, whereas, it is found that v' = c. Indeed, in all cases 
studied, v' = c for all values of V. 

3.2 Einstein’s space-time symmetry: the Lorentz transformation 

It was Einstein, above all others , who advanced our understanding of the nature of space- 
time and relative motion. He made use of a symmetry argument to find the changes that must be 
made to the Galilean transformation if it is to account for the relative motion of rapidly moving objects 
and of beams of light. Einstein recognized an inconsistency in the Galilean-Newtonian equations, 
based as they are, on everyday experience. The discussion will be limited to non-accelerating, or so 
called inertial, frames 

We have seen that the classical equations relating the events E and E' are E' = GE, and 
the inverse E = G^E' where 



r 


r 


1 0 


1 0 

G = 

-V 1 

^ j 

and G- 1 = 

V 1 

^ J 
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These equations are connected by the substitution V <-> -V; this is an algebraic statement of the 
Newtonian principle of relativity. Einstein incorporated this principle in his theory. He also retained the 
linearity of the classical equations in the absence of any evidence to the contrary. (Equispaced 
intervals of time and distance in one inertial frame remain equispaced in any other inertial frame). He 
symmetrized the space-time equations as follows: 


f ^ 

t' 


' 1 -V 


r k 

t 

x' 


-V 1 


X 

k j 


k k 


k J 


(3.5) 


Note, however, the inconsistency in the dimensions of the time-equation that has now been introduced: 


f = t-Vx. 


The term Vx has dimensions of [Lp/[T], and not [T]. This can be corrected by introducing the invariant 
speed of light, c — a postulate in Einstein's theory that is consistent with the result of the Michelson- 
Moriey experiment: 

ct' = ct— Vx/c 

so that all terms now have dimensions of length. 

Einstein went further, and introduced a dimensionless quantity y instead of the scaling factor of 
unity that appears in the Galilean equations of space-time. This factor must be consistent with all 
observations. Tie equations then become 
ct- yet- p>yx 
x'=-|3yct+ yx, where |3=V/c. 


Tese can be written 
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E =LE, (3.6) 

where 

r "H 

Y -P>Y 

L = 

-Py y 

and E = [ct,x], 

L is the operator of the Lorentz transformation. 

The inverse equation is 

E = L-F (3.7) 


where 


L _1 = 


Y PY 
v Py y y 


This is the inverse Lorentz transformation, obtained from L by changing p -> -p (V -> -V); it has the 
effect of undoing the transformation L. We can therefore write 


LL 1 = I 


(3.8) 


Carrying out the matrix multiplications, and equating elements gives 


Y 2 -P 2 y 2 =1 


therefore, 


y = W(1 ~ P 2 ) (taking the positive root). (3.9) 

As V -» 0, |3 -> 0 and therefore y -> 1; this represents the classical limit in which the Galilean 
transformation is, for all practical purposes, valid. In particular, time and space intervals havethe same 
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measured values in all Galilean frames of reference, and acceleration is the single fundamental 
invariant. 

3.3 The invariant interval: contravariant and covariant vectors 

Previously, it was shown that the space-time of Galileo and Newton is not Pythagorean under 
G. We now ask the question: is Einsteinian space-time Pythagorean under L ? Direct calculation 
leads to 

(ct) 2 + x 2 = y 2 (1 + |3 2 )(ct') 2 + 4(3y 2 x'ct' 

+y 2 (1+p> 2 )x' 2 

^(ct')2 + x' 2 if(3>0. 

Note, however, that the difference of squares is an invariant. 

(ct) 2 — x 2 = (ct) 2 — x 2 (3.10) 

because 

Y 2 (1 - P 2 ) = 1 ■ 

Space-time is said to be pseudo-Eudidean. The negative sign that characterizes Lorentz invariance 
can be induded in the theory in a general way as follows. 

We introduce two kinds of 4-vectors 

w = [x°, x 1 , x 2 , x 3 ], a contravariant vector, (3.11) 


and 


X|j = [xo, xi, X 2 , X 3 ], a covariant vector, where 
X|j = [x°, -X 1 , -X 2 , -X 3 ]. 


(3.12) 
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Tine scalar (or inner) product of the vectors is defined as 

#x M =(x°, x 1 , x 2 , x 3 )[x°, -x 1 , -x 2 , -x 3 ], to conform to matrix multiplication 

t t 

row column 

^-((x^+lxf+lx 3 ) 2 ). (3.13) 

Tine superscript! is usually omitted in writing the invariant; it is implied in the form 
The event 4-vector is 

= [ct, x, y, z] and the covariant form is 
Ep = [ct, — x, — y, — z] 


so that the invariant scalar product is 

Er j E| J = (ct) 2 -(x 2 + y 2 + z 2 ). (3.14) 

A general Lorentz 4-vector x^ transforms as follows: 

yy = Lw (3.15) 


where 


L = 


r 

y — (3y 0 0 

— (3y y 0 0 

0 0 10 

0 0 0 1 


-\ 


This is the operator of the Lorentz transformation if the motion of O' is along the x-axis of O’s frame of 
reference, and the initial times are synchronized (t = t' = 0 at x = x' = 0). 

Two important consequences of the Lorentz transformation, discussed in 3.5, are that intervals 
of time measured in two different inertial frames are not the same; they are related by the equation 


At'=yAt 


(3.16) 
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where At is an interval measured on a dock at rest in O's frame, and distances are given by 

AT = Al/y (3.17) 

where Al is a length measured on a ruler at rest in O's frame. 

3.4 The group structure of Lorentz transformations 

The square of the invariant interval s, between the origin [0, 0, 0, 0] and an arbitrary event t = 
[x°, x 1 , x 2 , x 3 ] is, in index notation 

s 2 = x ! \ = x' ! 'x' [( , (sum over u=0, 1,2, 3). (3.18) 

The lower indices can be raised using the metric tensorr^ = diag(1 , -1 , -1 , -1 ), so that 

s 2 = r| m x ! 'x v = i'|, a x' ! 'x' v , (sum over u and v). (3.19) 

The vectors now have contravariant forms. 

In matrix notation, the invariant is 

s? = x T rix=x T rix'. (3.20) 

(The transpose must be written explicitly). 

The primed and unprimed column matrices (contravariant vectors) are related by the Lorentz matrix 
operator, L 

x' = Lx. 

We therefore have 

x t tix = (Lx) t ti(Lx) 

=x T L T iqLx. 


The x’s are arbitrary, therefore 
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L T riL=T]. (3.21) 

This is the defining property of the Lorentz transformations. 

The set of all Lorentz transformations is the set L of all 4 x 4 matrices that satisfies the defining 

property 

L = {L: L T r) L = rj ; L all 4 x 4 real matrices; r\ = diag(1 ,-1,-1, -1}. 

(Note that each L has 16 (independent) real matrix elements, and therefore belongs to the 16- 
dimensional space, R 16 ). 

Consider the result of two successive Lorentz transformations Li and bthat transform a 4- 
vectorx as follows 

x— > x — > X 

where 

x = Lix , 
and 

x = bx'. 

The resultant vectorx" is given by 
x" = b(Lix) 

= bbx 
= Ux 

where 


b= bb (Li followed by L 2 ). 


(3.22) 
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If the combined operation L is always a Lorentz transformation then it must satisfy 
L t tiLc=ti . 

We must therefore have 

(bLi) T Ti(U_i) =r\ 


or 

Li T (b T rib)Li =r\ 

so that 

Li T r) Li = ri , (Li.UEL) 

therefore 

L=ULiGL. (3.23) 

Any number of successive Lorentz transformations may be carried out to give a resultant that is itself a 
Lorentz transformation. 

If we take the determinant of the defining equation of L, 
det(L T r| L) = detr) 

we obtain 

(detL) 2 = 1 (detL=detL T ) 

so that 


del=±1. (3.24) 

Since the determinant of L is not zero, an inverse transformation L- 1 exists, and the equation L-l = I, 
the identity, is always valid. 
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Consider the inverse of the defining equation 
(L T r| L) _1 = T| _1 , 
or 

L -1 r 1 - 1 (L T ) _1 =ri _1 . 

Using r\=r\ \ and rearranging, gives 

L- 1 r 1 (L- 1 ) T =ri . (3.25) 

This result shows that the inverse L.- 1 is always a member of the set L. 

Tine Lorentz transformations L are matrices, and therefore they obey the associative property 
under matrix multiplication. 

We therefore see that 

1. If Li and be L , then b Li e L 

2. lfLGL,thenL- 1 GL 

3. The identity I = diag(1 , 1, 1, 1 )gL 
and 

4. The matrix operators L obey associativity. 

Tine set of all Lorentz transformations therefore forms a group. 

3.5 The rotation group 

Spatial rotations in two and three dimensions are Lorentz transformations in which the time- 
component remains unchanged. In Chapter 1 , the geometrical properties of the rotation operators are 
discussed. In this section, we shall consider the algebraic structure of the operators. 
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Let 0ft be a real 3x3 matrix that is part of a Lorentz transformation with a constant 
time-component, 

f i ootn 

0 (3.26) 

L = 0 9ft 

0 

% J 

In this case, the defining property of the Lorentz transformations leads to 

'100010001000 1000 
0 0-1 0 0 0 0-1 0 0 

0 01 T 0 0-1 0 0 0t = 0 0-1 0 (3.27) 

0 0 0 0-1 0 0 0 0-1 

V J V J V. J V J 

so that 

DTO = I , the identity matrix, diag( 1 , 1 , 1 ). 

This is the defining property of a threedimensional orthogonal matrix. (The related two -dimensional 
case is treated in Chapter 1). 

If x = [xi, X 2 , X 3 ] is a three-vector that is transformed under 01 to give x' then 

x T x' = x T 9rax = x T x = xi 2 + X 2 2 + X 3 2 = invariant under 01 . (3.28) 

The action of 01 on any three-vector preserves length. The set of all 3x3 orthogonal matrices is 
denoted by 0(3), 

0(3) = {01 : 0t T 0l = I, rj G Reals}. 

The elements of this set satisfy the four group axioms. 
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3.6 The relativity of simultaneity: time dilation and length contraction 

In order to record the time and place of a sequence of events in a particular inertial reference 
frame, it is necessary to introduce an infinite set of adjacent “observers”, located throughout the entire 
space. Each observer, at a known, fixed position in the reference frame, carries a clock to record the 
time and the characteristic property of every event in his immediate neighborhood. The observers are 
not concerned with non-local events. The clocks carried by the observers are synchronized — they all 
read the same time throughout the reference frame. The process of synchronization is discussed later. 
It is the job of the chief observer to collect the information concerning the time, place, and characteristic 
feature of the events recorded by all observers, and to construct the world line (a path in space-time), 
associated with a particular characteristic feature (the type of particle, for example). 

Consider two sources of light 1 and 2, and a point M midway between them. Let Ei denote 
the event “flash of light leaves 1”, and Ez denote the event “flash of light leaves 7. The events Ei and 
E 2 are simultaneous if the flashes of light from 1 and 2 reach M at the same time. The fact that the 
speed of light in free space is independent of the speed of the source means that simultaneity is 
relative. 

The clocks of all the observers in a reference frame are synchronized by correcting them for 
the speed of light as follows: 

Consider a set of clocks located at xo, xi, X2, X3, ... along the x-axis of a reference frame. Let xo 
be the chiefs dock, and let a flash of light be sent from the dock at xo when it is reading to (12 noon, 
say). At the instant that the light signal reaches the dock at xi, it is set to read to + (xi/c), at the instant 
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that the light signal reaches the dock at X2, it is set to read to + (xafc) , and so on for every dock along the 
x-axis. All docks in the reference frame then “read the same time”— they are synchronized. From the 
viewpoint of all other inertial observers, in their own reference frames, the set of docks, sychronized 
using the above procedure, appears to be unsychronized. It is the lack of symmetry in the 
sychronization of docks in different reference frames that leads to two non-intuitive results namely, 
length contraction and time dilation. 

Length contraction: an application of the Loientz transformation. 

Consider a rigid rod at rest on the x-axis of an inertial reference frame S'. Because it is at rest it does 
not matter when its end-points xi ' and are measured to give the rest-, or proper-length of the rod, Lo' 

= X2 — Xl . 

Consider the same rod observed in an inertial reference frame S that is moving with constant velodty- 
V with its x-axis parallel to the x'-axis. We wish to determine the length of the moving rod; we require 
the length L = X 2 - xi according to the observers in S. This means that the observers in S must 
measure xi and X 2 at the same time in their reference frame. Tine events in the two reference frames 
S, and S' are related by the spatial part of the Lorentz transformation: 
x'=-(3yct+YX 

and therefore 

X 2 ' - xT = — (3yc(t2 - ti) + y(x 2 - xi). 

where 


(3 = V/c and y = 1/V(1 — (3 2 ). 
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Since we require the length (& - xi) in S to be measured at the same time in S, we must have fe - ti = 
0, and therefore 

U' = X2'-Xl' = Y(X2-Xl), 

or 

Lo'(at rest) = yL (moving). (3.29) 

The length of a moving rod, L, is therefore less than the length of the same rod measured at rest, Lo 
becausey>1. 

Time dilation 

Consider a dock at rest at the origin of an inertial frame S', and a set of synchronized docks at 
xo, xi, X 2 , ... on the x-axis of another inertial frame S. Let S' move at constant speed V relative to S, 
along the common x -, x'- axis. Let the docks at Xo, and Xo' be sychronized to read fc , and to' at the 
instant that they coincide in space. A proper time interval is defined to be the time between two events 
measured in an inertial frame in which the two events occur at the same place. The time part of the 
Lorentz transformation can be used to relate an interval of time measured on the single clock in the S' 
frame, and the same interval of time measured on the set of synchronized docks at rest in the S frame. 
We have 

ct=yct' + (3yx' 
or 

c(fe - ti ) = yctfc - ti ') + P>y(x 2 ' - xi '). 

There is no separation between a single dock and itself, therefore - xi ' = 0, so that 
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c(fe - tiXmoving) = ycfc' - ti ')(at rest) (y > 1 ). (3.30) 

A moving clock runs more slowly than a dock at rest. 

In Chapter 1, it was shown that the general 2 x2 matrix operator transforms rectangular coordinates 
into oblique coordinates. Tine Lorentz transformation is a spedal case of the 2x2 matrices, and 
therefore its effect is to transform rectangular space-time coordinates into oblique space-time 
coordinates: 



The geometrical form of the Lorentz transformation 
The symmetry of space-time means that the transformed axes rotate through equal angles, 
tan- 1 ) 3 ). The relativity of simultaneity is dearly exhibited on this diagram: two events that occur at the 
same time in the ct, x -frame necessarily occur at different times in the oblique ct', x -frame. 

3.7 The 4-velocity 

A differential time interval, dt, cannot be used in a Lorentz-invariant way in kinematics. We 
must use the proper time differential interval, dr, defined by 
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(ccf-dx 2 = (cdty-dx' 2 = (ode) 2 . (3.31) 

The Newtonian 3-velocity is 

vn = [dx/dt, dy/dt, dz/dt], 
and this must be replaced by the 4-velocity 

Vm = [d(ct)/di, dx/di, dy/di, dz/di] 

= [d(ct)/dt, dx/dt, dy/dt, dz/dt](dt/dr) 

= E/c,yvn]. (3.32) 

The scalar product is then 

VmV m = (yc) 2 - (yvn ) 2 (the transpose is understood) 

=m-m) 

= &. (3.33) 


Tie magnitude of the 4-velodty is therefore | V^| = c, the invariant speed of light. 
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PROBLEMS 

3-1 Two points A and B move in the plane with constant velocities |va| = V2 m.s- 1 and |vb| = 2V2 m.s -1 . 
They move from their initial (t = 0) positions, A(0)[1 , 1] and B(0)[6, 2] as shown: 



Show that the closest distance between the points is |R| mn = 2.529882.. meters, 
and that it occurs 1 .40... seconds after they leave their initial positions. (Remember 
that all inertial frames are equivalent therefore choose the most appropriate for 
dealing with this problem). 

3-2 Show that the set of all standard (motion along the common x-axis) Galilean 
transformations forms a group. 

3-3 A flash of light is sent out from a point xi on the x-axis of an inertial frame S, and it is 
received at a point X 2 = xi + 1. Consider another inertial frame, S', moving with 
constant speed V = (3c along the x-axis; show that, in S': 
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i) the separation between the point of emission and the point of reception of the light 

is l'=l{(1-|3)/(1+|3)P 

ii) the time interval between the emission and reception of the light is 

At' = M1-P)/(1+pr 

34 The distance between two photons of light that travel along the x-axis of an inertial 
frame, S, is always I. Show that, in a second inertial frame, S', moving at constant 
speed V = |3c along the x-axis, the separation between the two phot ons is 
Ax'=l{(1+|3)/(1-|3F. 

3-5 An event [ct, x] in an inertial frame, S, is transformed under a standard Lorentz 
transformation to [ct', x'] in a standard primed frame, S', that has a constant speed V 
along the x-axis, show that the velocity components of the point x, x' are related by 
the equation 

Vx = (Vx' + V)/(1 + (Vx'V/C 2 )). 

3-6 An object called a K°-meson decays when at rest into two objects called Ti-mesons 
(tt±), each with a speed of 0.8c. If the K°-meson has a measured speed of 0.9c when it 
decays, show that the greatest speed of one of then-mesons is (85/86)c and that its 


least speed is (5/1 4)c. 
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4 

NEWTONIAN DYNAMICS 

Although our discussion of the geometry of motion has led to major advances in our 
understanding of measurements of space and time in different inertial systems, we have yet to come to 
the crux of the matter, namely — a discussion of the effects of forces on the motion of two or more 
interacting particles. This key branch of Physics is called Dynamics. It was founded by Galileo and 
Newton and perfected by their followers, most notably Lagrange and Hamilton. We shall see that the 
Newtonian concepts of momentum and kinetic energy require fundamental revisions in the light of the 
Einstein’s Special Theory of Relativity. The revised concepts come about as a result of Einstein's 
recognition of the crucial role of the Principle of Relativity in unifying the d/namics of all mechanical and 
optical phenomena. In spite of the conceptual difficulties inherent in the classical concepts, (difficulties 
that will be discussed later), the subject of Newtonian dynamics represents one of the great triumphs of 
Natural Philosophy. The successes of the classical theory range from accurate descriptions of the 
dynamics of everyday objects to a detailed understanding of the motions of galaxies. 

4.1 The law of inertia 

Galileo (1544-1 642) was the first to develop a quantitative approach to the study of motion. He 
addressed the question — what property of motion is related to force? Is it the position of the moving 
object? Is it the velocity of the moving object? Is it the rate of change of its velocity? ...The answer to 
the question can be obtained only from observations; this is a basic feature of Physics that sets it apart 
from Philosophy proper. Galileo observed that force influences the changes in velocity (accelerations) 



76 


of an object and that, in the absence of external forces (e.g: friction), no force is needed to keep an 
object in motion that is travelling in a straight line with constant speed. This observationally based law is 
called the Law of Inertia. It is, perhaps, difficult for us to appreciate the impact of Galileo's new ideas 
concerning motion. The fact that an object resting on a horizontal surface remains at rest unless 
something we call force is applied to change its state of rest was, of course, well-known before Galileo's 
time. However, the fact that the object continues to move after the force ceases to be applied caused 
considerable conceptual difficulties for the early Philosophers (see Feynman The Character of Physical 
Law}, fine observation that, in practice, an object comes to rest due to fricticnal forces and air 
resistance was recognized by Galileo to be a side effect, and not germane to the fundamental question 
of motion. Aristotle, for example, believed that the true or natural state of motion is one of rest. It is 
instructive to consider Aristotle's conjecture from the viewpoint of the Principle of Relativity — is a 
natural state of rest consistent with this general Principle? According to the general Principle of 
Relativity, the laws of motion have the same form in all frames of reference that move with constant 
speed in straight lines with respect to each other. An observer in a reference frame moving with 
constant speed in a straight line with respect to the reference frame in which the object is at rest would 
conclude that the natural state or motion of the object is one of constant speed in a straight line, and not 
one of rest. All inertial observers, in an infinite number of frames of reference, would come to the same 
conclusion. We see, therefore, that Aristotle's conjecture is not consistent with this fundamental 
Principle. 
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42 Newton’s laws of motion 

During his early twenties, Newton postulated three Laws of Motion that form the basis of 
Classical Dynamics. He used them to solve a wide variety of problems including the dynamics of the 
planets. The Laws of Motion, first published in the Principia in 1687, play a fundamental role in 
Newton’s Theory of Gravitation (Chapter 7); they are: 

1 . In the absence of an applied force, an object will remain at rest or in its present state of constant 
speed in a straight line (Galileo's Law of Inertia) 

2. In the presence of an applied force, an object will be accelerated in the direction of the applied force 
and the product of its mass multiplied by its acceleration is equal to the force. 

and, 

3. If a body A exerts a force of magnitude |Fab| on a body B, then B exerts a force of equal magnitude 
|Fba| on A.. The forces act in opposite directions so that 

Fab=-Fba. 

In law number 2, the acceleration lasts only while the applied force lasts, fine applied force need not, 
however, be constant in time — the law is true at all times during the motion. Law number 3 applies to 
“contact’ interactions. If the bodies are separated, and the interaction takes a finite time to propagate 
between the bodies, the law must be modified to include the properties of the “field “ between the 
bodies, finis important point is discussed in Chapter 7. 

4.3 Systems of many interacting particles: conservation of linear and angular 


momentum 
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Studies of the dynamics of two or more interacting particles form the basis of a key part of 
Physics. We shall deduce two fundamental principles from the Laws of Motion; they are: 

1) The Conservation of Linear Momentum which states that, if there is a direction in which the sum of 
the components of the external forces acting on a system is zero, then the linear momentum of the 
system in that direction is constant and 

2) Tine Conservation of Angular Momentum which states that, if the sum of the moments of the 
external forces about any fixed axis (or origin) is zero, then the angular momertum about that axis (or 
origin) is constant. 

Tine new terms that appear in these statements will be defined later. 

The first of these principles will be deduced by considering the dynamics of two interacting 
particles of masses mi and m 2 wiith instantaneous coordinates [x, yi ] and [X2, yi respectively. In 
Chapter 12 , these principles will be deduced by considering the invariance of the Laws of Motion under 
translations and rotations of the coordinate systems. 

Let the external forces acting on the particles be Fi and F2 , and let the mutual interactions be 
F2T and F12'. The system is as shown 



Resolving the forces into their x- and y-components gives 
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X 

a) The equations of motion 

Tine equations of motion for each particle are 

1) Resolving in the x-direction 

Fxi + Fx 2 i'=mi(d 2 xi/dt 2 ) (4.1) 

and 

Fx 2 -Fxi 2 ' = m 2 (dW). (4.2) 

Adding these equations gives 

Fxi + Fx 2 + (F*T- Fxi 2 ') = mi(d 2 xi/dt 2 ) + m 2 (d 2 x^dt 2 ). (4.3) 

2) Resolving in the y-direction gives a similar equation, namely 

Fyl + Fy2 + (Fy2i' - Fyi 2 ') = mi^i/dt 2 ) + m 2 (d 2 y 2 / dt 2 ). (4.4) 

b) The role of Newton’s 3rd Law 

For instantaneous mutual interactions, Newton’s 3rd Law gives |F 2 i'| = |Fi 2 '| 

so that the x- and y-components of the internal forces are themselves equal and opposite, therefore tie 

total equations of motion are 


Fxi + Fx2 = mi(d 2 xi/dt 2 ) + m 2 (d 2 X2/dt 2 ) : 


(4.5) 
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and 


F y i + Fy 2 = mi(d 2 yi/dt 2 )+ rr^d^dt 2 ). 
c) The conservathn of linear momentum 

If the sum of the external forces acting on the masses in the x-direction is zero, then 
Fxl + Fx2 = 0 , 

in which case, 

0 = mi(d 2 xi/dt 2 ) + ni2(d 2 X2/dt 2 ) 


or 


(4.6) 


(4.7) 


0 = (d/dt)(miv x i) + (d/dt)(rri2Vx2), 
which, on integration gives 

constant = imi + rrw^ . (4.8) 

The product (mass x velocity) is the linear momentum. We therefore see that if there is no resultant 
external force in the x-direction, the linear momentum of the two particles in the x-direction is conserved. 
The above argument can be generalized so that we can state: the linear momentum of the two 
particles is constant in any direction in which there is no resultant external force. 

4.3.1 Interaction of n-particles 

flie analysis given in 4.3 can be carried out for an arbitrary number of particles, n, 
with masses mi, m 2 , ...m n and with instantaneous coordinates [xi, yi], [X 2 , y 2 ] ..[Xn, y n ]. "Tine 
mutual interactions cancel in pairs so that the equations of motion of the n-particles are, in 


the x-direction 



Fxi + Fx2 + ... F>n = mixi + m2X2 + ... mnXn = sum of the x-components of 


(4.9) 


the external forces acting on the masses, 

and, in the y-direction 

•• •• •• 

F y i + Fy2+... F^ = miyi + miy 2 + .. .m n y n = sum of the y-components of (4.1 0) 

the external forces acting on the masses. 

In this case, we see that if the sum of the components of the external forces acting on the 
system in a particular direction is zero, then the linear momentum of the system in that direction is 
constant. If, for example, the direction is the x-axis then 

miVxi + ni 2 Vx 2 + ... rrinVxn = constant. (4.1 1) 

4.3.2 Rotation of two interacting particles about a fixed point 

We begin the discussion of the second fundamental conservation law by considering the 
motion of two interacting particles that move under the influence of external forces Fi and F2, and 
mutual interactions (internal forces) F21' and F12'. We are interested in the motion of the two masses 
about a fixed point 0 that is chosen to be the origin of Cartesian coordinates. The perpendiculars 
drawn from the point 0 to the lines of action of the forces are Ri, R2, and R’. The system is illustrated in 
the following figure. 
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a) The moment of forces about a fixed origin 

Tine total moment of the forces about the origin 0 is defined as 
Ti2 = R1F1 + R2F2 + (R F12 - R F21) ( 4 . 12 ) 

t t 

moment of moment of 

external forces internal forces 

A positive moment acts in a counter-clockwise sense. 

Newton’s 3 rd Law gives 

IF21I = IF12I , 

therefore the moment of the internal forces about 0 is zero. (Their lines of action are the same). 

Tie total effective moment about 0 is therefore due to the external forces, alone. Writing the moment 
in terms of the x- and y-components of Fi and F2, we obtain 


ri 2 -xiFyi+ x2Fy2 - yi Fxi - y2Fx2 


( 4 . 13 ) 
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b) The conservation of angular momentum 

If the moment of the external forces about the origin 0 is zero then, by integration, we have 
constant = xipyi + x 2 p^ - yip x i - y 2 Px 2 . 
where pxi is the x-component of the momentum of mass 1 , etc.. 

Rearranging, gives 

constant = (xipyi -yipxi) + (x 2 p^-y 2 Px 2 ). (4.14) 

The right-hand side of this equation is called the angular momentum of the two particles about the fixed 
origin, 0. 

Alternatively, we can discuss the conservation of angular momentum using vector analysis. 
Consider a non-relativistic particle of mass m and momentum p, moving in the plane under the 
influence of an external force F about a fixed origin, 0: 



The angular momentum, L, of m about 0 can be written in vector form 

L=rxp. (4.15) 

The torque, T, associated with the external force F acting about 0 is 

r = rxF. (4.16) 


The rate of change of the angular momentum with time is 
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dL/dt = r x (dp/dt) + p x (dr/dt) (4.17) 

= rxm(dv/dt) + mvxv 
= rxF(becausevxv=0) 

=r. 

If there is no external torque, T = 0. We have, therefore 

r = dUdt=0, (4.18) 

so that L is a constant of the motion. 

4.3.3 Rotation of n-interacting particles about a fixed point 

The analysis given in 4.3.2 can be extended to a system of n-interacting particles. The 
moments of the mutual interactions about the origin 0 cancel in pairs (Newton’s 3rd Law) so that we 
are left with the moment of the external forces about 0. The equation for the total moment is therefore 

Fu n = I[Fi,n] (Xjd(miVyi)/dt - yd(miV>d)/dt). 

If the moment of the external forces about the fixed origin is zero then the total angular 
momentum of the system about 0 is a constant. This result follows directly by integrating the 
expression for ri, 2 , ...n=0. (4.19) 

If the origin moves with constant velocity, the angular momentum of the system, relative to the 
new coordinate system, is constant if the external torque is zero 
4.4 Work and energy in Newtonian dynamics 
4.4.1 The principle of work: kinetic energy and the work done by forces 

Consider a mass m moving along a path in the [x, y]-plane under the influence of a 
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resultant force F that is not necessarily constant. Let the components of the force be F x and F y when 
the mass is at the point P[x, y]. We wish to study the motion of m in moving from a point A[x\, yA] 
where the force is Fa to a point B[xb, yB] where the force is Fb. The equations of motion are 


m(cMlt 2 ) = F x 

(4.20) 

m(d 2 y/dt 2 ) = F y 

(4.21) 


Multiplying these equations by dx/dt and dy/dt, respectively, and adding, we obtain 
m(dx/dt)(d 2 x/dt 2 ) + m(dy/dt)(d2y/dt 2 ) = F x (dx/dt) + Fy(dy/dt). 

This equation now can be integrated with respect to t, so that 
m((dx/dt)2 + (dy/dt)2)/2 = j(Fxdx + Fydy) . 
or 

mv 2 /2=j(F x dx+F y dy), (4.22) 

where v = ((dx/dt) 2 + (dy/dt) 2 ) 12 is the speed of the particle at the point [x, y]. The term mW2 is called the 
classical kinetic energy of the mass m. It is important to note that the kinetic energy is a scalar. 

If the resultant forces acting on m are Fa at A[xa, y^ at time t\, and Fb at B[xb, yB] at time fe, then 

we have 

mvB 2 ^ - mvA 2 /2 = JjxAxBfxdx + JyA^Fydy . (4.23) 

The terms on the right-hand side of this equation represent the work done by the resultant forces acting 
on the particle in moving it from A to B. The equation is the mathematical form of the general Principle 



86 


of Work, the change in the kinetic energy of a system in any interval of time is equal to the work done 
by the resultant forces acting on the system during that interval. 

4.5 Potential energy 
4.5.1 General features 

Newtonian dynamics involves vector quantities — force, momentum, angular momentum, 
etc.. There is, however, another form of dynamics that involves scalar quantities; a form that originated 
in the works of Huygens and Leibniz, in the 17th century. The scalar form relies upon the concept of 
energy, in its broadest sense. We have met the concept of kinetic energy in the previous section. We 
now meet a more abstract quantity called potential energy. 

fine work done, W, by a force, F, in moving a mass m from a position g* to a position sb along 
a path sis, from section 4.3, 

W = jjsA sB] F-ds = the change in the kinetic energy during the motion, 

= J[sa,sb] Fdscosa, where a is the angle between F and ds. (4.24) 

If the force is constant, we can write 
W=F(sb-Sa), 

where sb - sa is the arc length. 

If the motion is along the x-axis, and F = F x is constant then 

W = F x (xb - xa), the force multiplied by the distance moved. (4.25) 

This equation can be rearranged, as follows 


mvxB 2 /2 — F xXb = mvxA 2 /2 - FxXa . 


(4.26) 
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This is a surprising result; the kinetic energy of the mass is not conserved during the motion whereas 
the quantity (mv^/2 - FxX) is conserved during the motion. This means that the change in the kinetic 
energy is exactly balanced by the change in the quantity FxX. 

Since the quantity mv 2 /2 has dimensions of energy, the quantity FxX must have dimensions of energy if 
the equation is to be dimensionally correct. The quantity— FxX is called the potential energy of the mass 
m, when at the position x, due to the influence of the force F* We shall denote the potential energy by 
V. The negative sign that appears in the definition of the potential energy will be discussed later when 
explicit reference is made to the nature of the force (for example, gravitational or electromagnetic). 

Tie energy equation can therefore be written 

Tb + V b = Ta+Va. (4.27) 

Tis is found to be a general result that holds in all cases in which a potential energy function 
can be found that depends only on the position of the object (or objects). 

4.5.2 Conservative forces 

Let F x and F y be the Cartesian components of the forces acting on a moving particle with 
coordinates [x, yj. Te work done Wi ^ by the forces while the particle moves from the position 
Pi [xi, yi] to another position X 2 , y^j is 

Wi^2 - J[xi,x2] Fxdx + J"[yi,y2] Fydy (4.28) 

= Jpl.pg (Fxdx + Fydy) . 


If the quantity Fxdx + Fydy is a perfect differential then a function U = f(x, y) exists such that 

F x = dll/3x and F y = dU/dy . (4.29) 
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Now, the total differential of the function U is 

dll = (dll/3x)dx + (dU/dy)dy (4.30) 

= Fxdx + Fydy. 

In this case, we can write 

JdU = /(Fxdx + F^dy) = U = f(x, y). 

The definite integral evaluated between Pi [xi, yi] and P 2 IX 2 , y&] is 

Jpi,P 2 i(F<dx + Fydy) =f(xa y 2 )-f(xi, y,) = U 2 -U 1 . (4.31) 

We see that in evaluating the work done by the forces during the motion, no mention is made of the 
actual path taken by the particle. If the forces are such that the function U(x, y) exists, then they are 
said to be conseivative. The function U(x, y) is called the force function. 

The above method of analysis can be applied to a system of many particles, n. The total work 
done by the resultant forces acting on the system in movrg the particles from their initial configuration, i, 
to their final configuration, f, is 

Wuf= Xi^i,n]lpki,Pk 2 |(FkxdXk+ Fkydyk), (4.32) 

= Uf — Ui, 

a scalar quantity that is independent of the paths taken by the individual particles. Pm [xki, ywi] and 
Pk 2 [Xk 2 , yk 2 ] are the initial and final coordinates of the kth-particle. 

The potential energy, V, of the system moving under the influence of conservative forces is 
defined in terms of the function U: V= - U . 


Examples of interactions that take place via conservative forces are: 
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1) gravitational interactions 

2) electromagnetic interactions 
and 

3) interactions between particles of a system that, for every pair of particles, act along the line 
joining their centers, and that depend in some way on their distance apart. These are the socalled 
central interactions. 

Frictional forces are examples of norvconservative forces. 

There are two other major methods of solving dynamical problems that differ in fundamental 
ways from the method of Newtonian dynamics; they are Lagrangian dynamics and Hamiltonian 
dynamics. We shall delay a discussion of these more general methods until our study of the Calculus 
of Variations in Chapters 

4.6 Particle interactions 
4.6.1 Elastic collisions 

Studies of the collisions among objects, first made in the 1 7th-century, led to the discovery of 
two basic laws of Nature: the conservation of linear momentum, and the conservation of kinetic energy 
associated with a special class of collisions called elastic collisions. 

The conservation of linear momentum in an isolated system forms the basis for a quantitative 
discussion of all problems that involve the interactions between particles. The present discussion will 
be limited to an analysis of the elastic collision between two particles. A typical two-body collision, in 
which an object of mass mi and momentum pi makes a grazing collision with another object of mass 
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m 2 and momentum p 2 (P 2 < pi), is shown in the following diagram. (Tine coordinates are chosen so 
that the vectors pi and p2 have the same directions). After the collision, the two objects move in 
directions characterized by the angles 0 and (|) with momenta pi ' and P2'. 

Before After 



If there are no external forces acting on the particles so that the changes in their states of 
motion come about as a result of their mutual interactions alone, the total linear momentum of the 
system is conserved. We therefore have 

pi + P2 = pi + P 2 (4.33) 

or, rearranging to give the momentum transfer, 

Pl-pi' = P2 — P2. 

fine kinetic energy of a particle, T is related to the square of its momentum 
(T = p2/2m); we therefore form the scalar product of the vector equation for the momentum transfer, to 
obtain 

pi 2 - 2pi • pi ' + pi ' 2 = P2' 2 - 2p 2 '- P2 + P2 2 . (4.34) 

Introducing the scattering angles 0 and cp, we have 
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Pi 2 - 2piPl 'COS0 + Pi' 2 = P 2 2 — 2p2P2COS(|) + P 2' 2 . 

This equation can be written 

pi ' 2 (x 2 - 2xcos0 + 1 ) = P 2 ,2 (y 2 - 2ycoscp + 1 ) (4.35) 

where 

x = pi/pi ' andy=p^p 2 '. 

If we choose a frame in which P 2 = 0 then y = 0 and we have 

x 2 - 2xcos0 + 1 = (P 2 VP 1 ') 2 . (4.36) 

If the collision is elastic, the kinetic energy of the system is conserved, so that 

Ti + 0 = Ti' + T 2 ' (T 2 = 0 because P 2 = 0) . (4.37) 

Substituting Ti = p, 2 /2m, , and rearranging, gives 
(P27pi7=(m2/mi)(x 2 -1). 

We therefore obtain a quadratic equation in x: 

x 2 + 2 x(mi/(m 2 - mi))cos0 - [(m 2 + mi)/(m 2 - mi)] = 0 . 

Tine valid solution of this equation is 

x = (T 1 /T 1 ') 1C = - (mi/(m 2 - mi))cos0 

+ {(mi/(m 2 - mi))2cos 2 0 + [(m 2 + mi)/(m 2 - mi)]} 1,2 . (4.38) 

If mi = m 2 , the solution is x = 1/cos0, in which case 

Ti'=TiCOs20. (4.39) 

In the frame in which P 2 = 0, a geometrical analysis of the two-body collision is useful. We 


have 
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pi + (-pi') = p 2 ', (4.40) 



Pl' = PlCOS0. 


In this case, the two particles always emerge from the elastic collsion at right angles to each other 
(0 + 4> — 90°). 

In the early 1930’s, the measured angle between two outgoing high-speed nuclear particles of 
equal mass was shown to differ from 90°. Such experiments dearly demonstrated the breakdown of 
Newtonian dynamics in these interactions. 

4.6.2 Inelastic collisions 

Collisions between everyday objects are never perfectly elastic. An object that has an internal 
structure can undergo inelastic collisions involving changes in its structure. Inelasticcollisions are found 
to obey two laws; they are 

1 ) the conservation of linear momentum 
and 


2) an empirical law, due to Newton, that states that the relative velocity of the colliding objects, 
measured along their line of centers immediately after impact, is -e times their relative velodty 
before impact. 
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Tine quantity e is called the coefficient of restitution. Its value depends on the nature of the materials of 
the colliding objects. For very hard substances such as steel, e is close to unity, whereas for very soft 
materials such as putty, e approaches zero. 

Consider , in the simplest case, the impact of two deformable spheres with masses mi and 
m 2 . Let their velocities be vi and V2, and vi ' and \ii (along their line of centers) before and after impact, 
respectively. The linear momentum is conserved, therefore 
miVi + m2V2=mivT + m2V2' 
and, using Newton’s empirical law, 

vT-v 2 '=-e(vi-V 2 ). (4.41) 

Rearranging these equations, we can obtain the values vT and \ii after impact , in terms of their values 
before impact: 

vi ' = [mivi + m 2 V 2 - em 2 (vi - V 2 )]/(mi + m 2 ), (4.42) 

and 

mi = [mivi + 1 TI 2 V 2 + emi(vi - V 2 )]/(mi + m 2 ) . (4.43) 

If the two spheres initially move in directions that are not colinear, the above method of analysis 
is still valid because the momenta can be resolved into components along and perpendicular to a 
chosen axis. The perpendicular components remain unchanged by the impact. 

We shall find that the classical approach to a quantitative study of inelastic collisions must be 
radically altered when we treat the subject within the framework of Special Relativity. It will be shown 
that the combined mass (mi + m 2 ) of the colliding objects is not conserved \n an inelastic collision. 
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4.7 The motion of rigid bodies 

Newton’s Laws of Motion apply to every point-like mass in an object of finite size. The smallest 
objects of practical size contain very large numbers of microscopic particles — Avogadro’s number is 
about 6 x 10 23 atoms per gram-atom. The motions of the individual microscopic particles in an 
extended object can be analyzed in terms of the motion of their equivalent total mass, located at the 
center of mass of the object. 

4.7.1 The center of mass 

For a system of discrete masses, m, located at the vector positions, r, the position tcm of the 
center of mass is defined as 

rcM = Xi nw / D m = X mn / M, where M is the total mass. (4.44) 

The center of mass (CM) of an (idealized) continuous distribution of mass with a density p 
(mass/volume), can be obtained by considering an element of volume dV with an elemental mass dm. 
We then have 

dm = pdV. (4.45) 

The position of the CM is therefore 

r CM = (1/M)jrdm = (1/M)jrpdV. (4.46) 

The Cartesian components of rcM are 

x™=(1/l#xpdV. (4.47) 

In non-uniform materials, the density is a function of r. 

4.7.2 Kinetic energy of a rigid body in general motion 
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Consider a rigid body that has both translational and rotational motion in a plane. Let the 
angular velocity, go, be constant. At an arbitrary time ,t, we have 



Let the coordinates of an element of mass m of the body be [x, y] in the fixed frame (origin 0) and 
[x', y] in the frame moving with the center of mass, G (origin O'), and let u and v be the components of 
velocity of G, in the fixed frame. For constant angular velocity go, the instantaneous velocity of the 
element of mass m, relative to G has a direction perpendicular to the radius vectorr', and a magnitude 
v'=r'co. (4.48) 

Tine components of the instantaneous velocity of G, relative to the fixed frame, are 
uin the x-direction, and 
v in the y-direction. 

Tine velocity components of m in the [x, yj-frame are therefore 
u - reusing' = u - y 'go in the x-direction, 
and 


v + r'cooos(|>' = v + x'go in the y-direction. 
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Tine kinetic energy of the body, Ek, of mass M is therefore 

Ek = (1/2)Zm{(u - y co) 2 + (v + x'oo) 2 } (4.49) 

= (1/2)M(u2 + v 2 ) + (1/2)o> 2 Xm(x' 2 + y' 2 ) 

-ucD^my'+voalnix'. 

Therefore 

Ek=(1/2)Mvg2 + (I^Igcd 2 , (4.50) 

where 

vg = (u 2 + v 2 ) 1/2 the speed of G, relative to the fixed frame, 

Xmy' = Xmx' = 0, by definition of the center of mass, 
and 

Ig = Xm(x' 2 + y' 2 ) = Xmr' 2 , is called the moment of imeiiia of M about an axis 
through G, perpendicular to the plane. 

We see that the total kinetic energy of the moving object of mass M is made up of two parts, 

1 ) the kinetic energy of translation of the whole mass moving with the velocity of 
the center of mass, 

and 

2) the kinetic energy of rotation of the whole mass about its center of mass. 
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4.8 Angular velocity and the instantaneous center of rotation 

The angular velocity of a body is defined as the rate of increase of the angle between any line 
AB, fixed in the body, and any line fixed in the plane of the motion. If<|> is the instantaneous angle 
between AB and an axis Oy, in the plane, then the angular velocity is cfy/dt. 

Consider a circular disc of radius a, that rolls without sliding in contact with a line Ox, and letcj) 
be the instantaneous angle that the fixed line AB in the disc makes with the y-axis. At t = 0, the rolling 

begins with the point B touching the origin, 0: 

A 

y 
y 

0 x P (corresponds to (j)=0) x 

At time t, after the rolling begins, the coordinates of B[x, y] are 

x = OP — asinc)) = BP — asincj) = a4> — asincj) = a((j> - sincp), 
and 

y = AP - acoscj) = a(1 - coscj)). 

The components of the velocity of B are therefore 

v x = dx/dt = a(dcj)/dt)(1 - coscj)), (4.51 ) 

and 

v y = dy/dt = a(dcj)/dt)sincj). (4.52) 



The components of the acceleration of B are 
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a x = dvVdt = (d/dt)(a(d(|)/dt)(1 -cose)))) (4.53) 

= a(dcj)/dt) 2 sin(t) + a(1 - cos^Xd^/dt 2 ), 
and 

ay = dv/dt = (d/dt)(a(dcj)/dt)sin(t)) (4.54) 

= a(d4>/dt)2(X)ScJ) + asincj)(d 2 ct)/dt 2 ). 

14=0, 

dx/dt = 0 and dy/dt = 0, which means that the point P has no instantaneous velocity. 
The point B is therefore instantaneously rotating about P with a velocity equal to 2asin(cj)/2)(dcj)/dt); P is 
a “center of rotation”. 

Also, 

d 2 x/dt 2 = 0 and d 2 y/dt 2 = a(dc|Vdt) 2 , the point of contact only has an acceleration 
towards the center. 

4.9 An application of the Newtonian method 

The following example illustrates the use of some basic principles of classical dynamics, such 
as the conservation of linear momentum, the conservation of energy, and instantaneous rotation about 
a moving point: 

Consider a perfectly smooth, straight horizontal rod with a ring of mass M that can slide along 
the rod. Attached to the ring is a straight, hinged rod of length L and of negligible mass; it has a mass m 
at its end. At time t = 0, the system is held in a horizontal position in the constant gravitational field of the 


Earth. 
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Att=0: 



m L iyi 

x = 0 at t = 0 


At t = 0, the mass m is released and falls under gravity. At time t, we have 



LcoscKd^/dt) L(d(|Vdt) = instantaneous velocity of m 
about M 

There are no external forces acting on the system in the x-direction and therefore the horizontal 
momentum remains zero: 

M(dx/dt) + m((dx/dt) - Lsin^dt)) = 0. (4.56) 

Integrating, we have 

Mx + mx + mLcosc|) = constant. (4.57) 

If x = 0 and cj) = 0 at t = 0, then 

ml_= constant, (4.58) 

therefore 


(M + m)x+mL(cos(|)-1) = 0 i 
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so that 

x = ml_(1 - cos(|))/(M + m). (4.59) 

We see that the instantaneous position x(t) is obtained by integrating the momentum equation. 

The equation of conservation of energy can now be used; it is 
(IW2)v^ + (m/2)(v x - Lsin^/dt)) 2 + (m/2)(Lcosc|)(dcjVdt)) 2 = mgLsincp. 

(The change in kinetic energy is equal to the change in the potential energy). 

Rearranging, gives 

(M + m)Vx2 - 2mLsin(j)Vx(d(|)/dt) + (mL 2 (dc|Vdt) 2 - 2mgLsincJ)) = 0. (4.40) 

This is a quadratic in v x with a solution 

(M + m)v x = mLsin<Kdc(>/dt)[1 ± {1 - [(M + m)(mLWdtF 
- 2mLgsin(|))]/[m 2 L 2 (d(!Vdt) 2 sin 2 ct)]} 1 ' 2 ]. 

The left-hand side of this equation is also given by the momentum equation: 

(M + m)v x = mLsin(|)(d(|)/dt). 

We therefore obtain, after substitution and rearrangement, 

dcp/dt = {j2(M + m)gsinc|)]/[L(M + mcos 2 ^)]} 12 , (4.41 ) 

the angular velocity of the rod of length L at time t. 

PROBLEMS 

4-1 A straight uniform rod of mass m and length 21 is held at an angle 0o to the vertical. 

Its lower end rests on a perfectly smooth horizontal surface. The rod is released and 
falls under gravity. At timet after the motion begins, we have 
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If the moment of inertia of the rod about an axis through its center of mass, 
perpendicular to the plane of the motion, is mP/3, prove that the angular velocity of 
the rod when it makes an angle 0 with the vertical, is 

d0/dt = {6g(cos0o - cos0)/l(1 + 3sin 2 0)} 1/2 . 

4-2 Show that the center of mass of a uniform solid hemisphere of radius R is 3R/8 above 
the center of its plane surface. 

4-3 Show that the moment of inertia of a uniform solid sphere of radius R and mass M 
about a diameter is 2MR 2 /5. 

44 A uniform solid sphere of mass m and radius r can roll, under gravity, on the inner surface of a 
perfectly rough spherical surface of radius R. The motion is in a vertical plane. 

At time t during the motion, we have 
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Show that 

d 2 0/dt 2 + [5g/(7(R- r))]sin0 = 0. 

As a preliminary result, show that rto = (R - r)(d0/dt) for rolling motion without slipping. 
4-5 A particle of mass m hangs on an inextensible string of length I and negligible 
mass. The string is attached to a fixed point 0. fine mass oscillates in a vertical plane 
under gravity. At time t, we have 



Show that 

1) d 2 0/dt 2 + (g/l)sinO = 0. 

2) go 2 = (2g/l)[cos0 - cos0o], where 0o is the initial angle of the string with respect 

to the vertical, so that go = 0 when 0 = 0o. This equation gives the angular velocity 
in any position. 

4-6 Let b be the natural length of an elastic string fixed at the point 0. The string has a 
negligible mass. Let a mass m be attached to the string, and let it stretch the string 
until the equilibrium position is reached. The tension in the string is given by Hooke’s 
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law: 

Tension, T = X(extension)/original length, where X is a constant for a given material. 
The mass is displaced vertically from its equilibrium position, and oscillates under 
gravity. We have 


0 


Equilibrium General position 


Ye 


gll 


y(t) 


mg 


Ing 


Show that the mass oscillates about the equilibrium position with simple harmonic 
motion, and that 

y(t) = b + (mglc/>,){1 - cos[t Vx/mlo]} (starts with zero velocity at y(0) = b) 

4-7 A dynamical system is in stable equilibrium if the system tends to return to its original 
state if slightly displaced. A system is in a position of equilibrium when the height of 
its center of gravity is a maximum or a minimum. Consider a rod of mass m with one 
end resting on a perfectly smooth vertical wall OA and the other end on a perfectly 
smooth inclined plane, OB. Show that, in the position of equilibrium 
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cot0 = 2tancj), where the angles are given in the diagram: 



Find y = f(0), and show, by considering derivatives, that this is a state of unstable 
equilibrium. 

4-8 A particle A of mass rriA = 1 unit, scatters elastically from a stationary particle B of mass 
rriB = 2 units. If A scatters through an angle 0, show that the ratio of the kinetic 
energies of A, before (Ta) and after (TV) scattering is 
(TVTV) = (-cos0 + V3 + cos 2 0) 2 . 

Sketch the form of the variation of this ratio with angle in the range 0 < 0 < tl 
(This problem is met in practice in lowenergy neutron-deuteron scattering). 
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5 

INVARIANCE PRINCIPLES AND CONSERVATION LAWS 

5.1 Invariance of the potential under translations and the conservation of 
linear momentum 

The equation of motion of a Newtonian particle of mass m moving along the x-axis under the 
influence of a force F x is 

mdW=F x . (5.1) 

If F x can be represented by a potential V(x) then 

md 2 x/dt 2 =-dV(x)/dx. (5.2) 

In the special case in which the potential is not a function of x, the equation of motion becomes 
md 2 x/dt 2 =0, 
or 

md(v x )/dt=0. (5.3) 

Integrating this equation gives 

mv x = constant. (5.4) 

We see that the linear momentum of the particle is constant if the potential is independent of the 
position of the particle. 

5.2 Invariance of the potential under rotations and the conservation of angular 


momentum 
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Let a Newtonian particle of mass m move in the plane about a fixed origin, 0, under the 
influence of a force F. The equations of motion, in the x-and y-directions, are 

mcPx/dt 2 = F x and md 2 y/dt 2 = F y . (5.5 a,b) 

If the force can be represented by a potential V(x, y) then we can write 

mcPx/dt 2 = -3V/3x and mdydt 2 = -3V/3 y . (5.6 a,b) 

The total differential of the potential is 

dV = (dV/dx)dx + (dV/dy)dy. 

Let a transformation from Cartesian to polar coordinates be made using the standard linear equations 
x = rc»S(|) and y = rsincj) . 

The partial derivatives are 

3x/3(|) = -rsfo(|) = -y, 3x/3r = cos4>, dy/ety = rcosc|>= x, and 3y/3r = sincp . 

We therefore have 

dm j) = (3V/3x)(3x/a (>) + (3V/3y)(3y/at>) (5.7) 

= (3V/3x)(-y) + (3V/3y)(x) 

= yF x + x(-F y ) 

= m(yax ~ xa y ) (a x and a y are the components of acceleration) 

= m(d/dt)(yv x - xv y ) (v x and v y are the components of velocity). 

If the potential is independent of the angle c|> then 

3V/3(j)=0, (5.8) 


in which case 



107 


m(d/dt)(yvx-xv y )=0 
and therefore 

m(yv x - xv y ) = a constant. (5.9) 

The quantity on the left-hand side of this equation is the angular momentum (yp< - xpy) of the mass 
about the fixed origin. We therefore see that if the potential is invariant under rotations about the origin 
(independent of the angle 4>), the angular momentum of the mass about the origin is conserved. 

In Chapter 9 , we shall treat the subject of invariance principles and conservation laws in a 
more general way, using arguments that involve the Lagrangians and Hamiltonians of dynamical 
systems. 
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6 

EINSTEINIAN DYNAMICS 


6.1 4-momentum and the energy-momentum invariant 

In Classical Mechanics, the concept of momentum is important because of its role as an invariant in 
an isolated system. We therefore introduce the concept of ^momentum in Relativistic Mechanics in 
order to find possible Lorentz invariants involving this new quantity. The contravariant 4momentum is 
defined as: 

R J = mVr j (6.1) 

where m is the mass of the particle. (It is a Lorentz scalar— the mass measured in the rest frame of 
the particle). 

The scalar product is 

PMP M = (mc)2. (6.2) 

Now, 

P^ = [myc, myVN] (6.3) 

therefore, 

P^ J P M = (mYC) 2 -(mYVN) 2 . 

Writing 

M = ym, the relativistic mass, we obtain 

P^P M = (Me) 2 - (Mv n ) 2 = (me) 2 . (6.4) 

Multiplying throughout by & gives 
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M 2 c 4 -M 2 v N 2 c 2 =m 2 c 4 . (6.5) 

Tine quantity Me 2 has dimensions of energy; we therefore write 

E = Me 2 , (6.6) 

the total energy of a freely moving particle. 

This leads to the fundamental invariant of dynamics 

c 2 P% = E 2 -(pc) 2 = E 02 (6.7) 

where 

E° = me 2 is the rest energy of the particle, and p is its relativistic 3-momentum. 

The total energy can be written: 

E=yE°=E°+T, (6.8) 

where 

T=E°( y -1), (6.9) 

the relativistic kinetic energy. 

The magnitude of the ^momentum is a Lorentz invariant 

|Pm| = me. (6.10) 

fine 4- momentum transforms as follows: 

P'm = |_ p(j_ (6.11) 

6.2 The relativistic Doppler shift 

For relative motion along the x-axis, the equation P' u = LP M is equivalent to the equations 


E'= yE“(3ycp x 


( 6 . 12 ) 
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and, 

cp' x =-f 3 >yE + yep". (6.13) 

Using the Planck-Einstein equations E = hv and E = pc for photons, the energy equation becomes 
v'=yv-|3yv 

=yv(i-|3) 

= v(1 - (3)/(1 - 13 2 ) 12 

= v{(1-|3)/(1 + |3r (6.14) 

This is the relativistic Doppler shift for photons of the frequency v', measured in an inertial frame 
(primed) in terms of the frequency v measured in another inertial frame. 

6.3 Relativistic collisions and the conservation of 4-momentum 

Consider the interaction between two particles, 1 and 2, to form two particles, 3 and 4. (3 and 
4 are not necessarily the same as 1 and 2). The contravariant 4-momenta are P," : 

1 

1+2 3+4 

All experiments are consistent with the fact that the 4-momentum of the system is conseived. 
We have, for the contravariant 4-momentum vectors of the interacting particles, 




Ill 


Pf + P 2 " = P 3 " + P 4 !l (6.15) 

~ T ~ T 

initial “free” state final “free” state 
and a similar equation for the covariant 4-momentum vectors, 

Vfy = V P V (6.16) 

If we are interested in the change Pf -»> Rf , then we require 

Pi l -Pi l =Pf-Pf (6.17) 

and 

Pv-P3, = Pv-P^, (6.18) 

Forming the invariant scalar products, and using RJT = (EP/c) 2 , we obtain 
(Ei 0 /c)2 - 2(Ei Es/c 2 - pi -p 3 ) + (E3°/c) 2 

= (E 4 °/c) 2 - 2(E 2 E/c 2 - p 2 -p4) + (E 2 °/c) 2 . (6.19) 

Infrodudng the scattering angles, 0 and <|>, this equation becomes 


Ei 02 — 2(Ei E 3 — c 2 pi p 3 cos0) + E 3 02 = E 2 02 - 2(E 2 E4 - c 2 p 2 p 4 Coscp) + E 4 02 . 

If we choose a reference frame in which particle 2 is at rest (the LAB frame), then p 2 = 0 and 
E 2 = E 2 °, sothat 

Ei O2 -2(EiE 3 -c 2 pip 3 cos0) + E 3 ° 2 = E 2 ° 2 -2E 2 °E4 + E4 02 . (6.20) 

fine total energy of the system is conserved, therefore 


Ei + E 2 - E 3 + E4 - Ei + E 2 ° 


( 6 . 21 ) 
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or 

E4 = Ei + E2 0 — E3 

Eliminating E 4 from the above “scalar product equation gives 

E 1 02 - 2(EiE 3 - tfpipscose) + E 3 02 = E 4 02 - E 2 02 -2E 2 0 (Ei - Es). (6.22) 
This is the basic equation for all interactions in which two relativistic entities in the initial state interact to 
give two relativistic entities in the final state. It applies equally well to interactions that involve massive 
and massless entities. 

6.3.1 The Compton effect 

The general method discussed in the previous section can be used to provide an exact 
analysis of Compton’s famous experiment in which the scattering of a photon by a stationary, free 
electron was studied. In this example, we have 

Ei = Eph (the incident photon energy), E 2 = Ee° (the rest energy of the stationary electron, the 
“target’), E 3 = Eph' (the energy of the scattered photon), and E 4 = E e ' (the energy of the recoiling 
electron). The “rest energy” of the photon is zero: 



The general equation (6.22), is now 

0 - 2(EphEph' - EphEph'cosG) = Ee 02 -2Ee°(E^ + Ee°- Eph') + E e 02 


(6.23) 
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or 


- 2 EphEph '(1 - cos0) — 2 Ee°(Eph - Eph') 


so that 


Eph - Eph - EphEph (1 — COS0)/Ee° . 


(6.24) 


Compton measured the energy-loss of the photon on scattering and its cos0-dependence. 

6.4 Relativistic inelastic collisions 

We shall consider an inelastic collision between a particle 1 and a particle 2 (initially at rest) to 
form a composite particle 3. In such a collision, the 4-momentum is conserved (as it is in an elastic 
collision) however, the kinetic energy is not conserved. Part of the kinetic energy of particle 1 is 
transformed into excitation energy of the composite particle 3. This excitation energy can take many 
forms— heat energy, rotational energy, and the excitation of quantum states at the microscopic level. 
The inelastic collision is as shown: 


Before 


After 


1 2 3 


• ► o ©— ► 

Pi P2 = 0 P3 


Rest energy: Ei° E2 0 E3 0 

Total energy: Ei E2=E2° E3 

3-momentum: pi p 2 = 0 p3 

Kinetic energy: Ti T 2 = 0 T3 


In this problem, we shall use the energy-momentum invariants associated with each particle, directly: 
i) Ei 2 -(pic) 2 =Ei 02 (6.25) 
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ii) E2 2 =E 2 ° 2 

(6.26) 

iii) E^-(p 3 c) 2 =E 3 02 . 

Tine total energy is conserved, therefore 

(6.27) 

Ei + E 2 = E3 = Ei + E2 0 . 

Introducing the kinetic energies of the particles, we have 

(6.28) 

(Ti + Ei°) + E2 0 = T3 + E3 0 . 

The 3-momentum is conserved, therefore 

(6.29) 

pi + 0 = P3 . 

Using 

(6.30) 

E3 02 =E3 2 -(P3C) 2 , 

we obtain 

E 3 02 = (El + E2°) 2 -(P3C) 2 

= Ei 2 + 2EiE 2 ° + E2 02 - (pic ) 2 
= Ei° 2 + 2EiE 2 ° + E 2 ° 2 

(6.31) 

= E-i 0 2 + E 2 o 2 + 2 (T -i + Ei°)E 2 ° 
or 

(6.32) 

E3 0 2 = (Ei° + E2 0 ) 2 + 21 1 E 2 ° (E3 0 >Ei° + E 2 0 ). 
Using Ti = yiEi 0 - Ei°, where yi = (1 - | 3 i 2 )- 1c and |3i = vi/c, we have 

(6.33) 

E3O 2 = Ei° 2 + E 2 ° 2 + 2 yi Ei°E 2 ° . 

If two identical particles make a completely inelastic collision then 

(6.34) 
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E3 02 = 2( Y i + 1)Ei 02 . (6.35) 

6.5 The Mandelstam variables 

In discussions of relativistic interactions it is often useful to introduce additional Lorentz 
invariants that are known as Mandelstam variables. They are, for the special case 
of two particles in the initial and final states (1 + 2 -* 3 + 4): 

s = (Pi 11 + P 2 ! ')[Pi L( + P 2 J, the total 4-momentum invariant 
= ((Ei + E 2 )/c, (pi + p 2 ))[(Ei + E 2 )/c, -(pi + P 2 )] 

= (Ei + E 2 ) 2 /c 2 - (pi + P 2 ) 2 , a Lorentz invariant , (6.36) 

t = (Pf— PslPi^— P 3 J, the 4-momentum transfer (1-*3) invariant 
= (Ei - E 3 ) 2 /c 2 - (pi - P 3 ) 2 , a Lorentz invariant, (6.37) 

and 

u = (Pi" - P 4 ")[Pi m - P 4 J, the 4-momentum transfer (1— >4) invariant 
= (Ei - E 4 ) 2 /c 2 - (pi - p4 f, a Lorentz invariant. (6.38) 

Now, 

sc 2 = Ei 2 + 2 EiE 2 + E 2 2 - (pi 2 + 2pi-p 2 + p 2 2 )c 2 
= Ei° 2 + E 2 ° 2 + 2Ei E 2 - 2pi -p2C 2 

= Ei° 2 + E 2 ° 2 + 2(Ei, P ic)[E 2 , -P2C], (6.39) 

T 

Lorentz invariant 
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Tine Mandelstam variable sc? has the same value in all inertial frames. We therefore evaluate 
it in the LAB frame, defined by the vectors 

[EiK Pl Lc] and [E 2 L = E 2 °, -p 2 k; = 0], (6.40) 

so that 

2(Ei l E 2 l — pi L -p 2 L c 2 ) = 2 Ei l E 2 °, (6.41) 

and 

sc 2 = Ei° 2 + E 2 ° 2 + 2 Ei l E 2 °. (6.42) 

We can evaluate sc 2 in the center-of -mass (CM) frame, defined by the condition 
pi™ + Jp 2 m = 0 (the total 3-momentum is zero): 

sc2=( Ei cm +E2 cm)2 ( 643 ) 

This is the square of the total CM energy of the system. 

6.5.1 The total CM energy and the production of new particles 

The quantity cVs is the energy available for the production of new particles, or forexciting the 
internal structure of particles. We can now obtain the relation between the total CM energy and the 
LAB energy of the incident particle (1) and the target (2), as follows: 

sc 2 = Ei 02 + E 2 02 + 2EfE 2 ° = (Ei™ + E 2 ™) 2 = W 2 , say. (6.44) 

Here, we have evaluated the left-hand side in the LAB frame, and the right-hand side in the CM frame. 

At very high energies, cVs » Ei° and E 2 °, the rest energies of the particles in the initial state, in 
which case, 


W 2 = stf «2E 2 l E 2 °. 


(6.45) 



117 


Tine total CM energy, W, available for the production of new particles therefore depends on the square 
root of the incident laboratory energy. This result led to the development of colliding, or intersecting, 
beams of particles (such as protons and anti-protons) in order to produce sufficient energy to generate 
particles with rest masses greater than 1 00 times the rest mass of the proton (=1 0 9 eV). 

6.6 Positron-electron annihilation-in-flight 

A discussion of the annihilation-in-flight of a relativistic positron and a stationary electron 
provides a topical example of the use of relativistic conservation laws. This process, in which two 
photons are spontaneously generated, has been used as a source of nearly monoenergetic high- 
energy photons for the study of nuclear photo-disintegration since 1 960. The general result for a 1 + 2 
-^>3 + 4 interaction, given in section 6. 3, provides the basis for an exact calculation of this process; we 
have 

Ei = Epos (the incident positron energy), E2 = Ee° (the rest energy of the stationary electron), E3 
= Ephi (the energy of the forward-going photon), and E 4 = Eph 2 (the energy of the backward-going 
photon). The rest energies of the positron and the electron are equal. Tine general equation (6.22), 
now reads 

Ee 02 - 2{EposEphi - cpposEphi(cos0)} + 0 = 0 - Ee 02 -2Ee°(Epos - Ephi) (6.46) 

therefore, 

Ephl{Epos + Ee 0- [Epos 2 — EeT COS0} = (Ep* + Ee°)Ee°, 

giving 


Eftii=Ee°/(1-kCOS0) 


(6.47) 
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where 

k = [(Epos-Ee°)/(Epos+EeT- 

Tine maximum energy of the photon, Ephi 1 ™ occurs when 0 = 0, corresponding to motion in 
the forward direction; its energy is 

Ephi max =E oe /(1-k). (6.48) 

If, for example, the incident total positron energy is 30 MeV, and Ee° = 0.51 1 MeV then 
E P hi max = 0.511/[1 — (29.489 / 30.51 1 ) 1/2 ] MeV 
= 30.25 MeV. 

The forward-going photon has an energy equal to the kinetic energy of the incident positron 
(Ti =30— 0.51 1 MeV) plus approximately three-quarters of the total rest energy of the positron-electron 
pair (2Ee° = 1 .02 MeV). Using the conservation of the total energy of the system, we see that the 
energy of the backward-going photon is approximately 0.25 MeV. 

The method of positron-electron annihilation-in-flight provides one of the very few ways of generating 
nearly monoenergetic photons at high energies. 

PROBLEMS 

6-1 A particle of rest energy E° has a relativistic 3-momentum p and a relativistic kinetic 
energy T. Show that 

1) |p| = (1/c)(2TE°H1 + (T/2E°)P, 
and 

2) |v| = c{1 + [E 02 ^ + 2E°)]}- 1/2 , where v is the 3-velocity. 
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6-2 Two similar relativistic particles, A and B, each with rest energy EP, move towards 
each other in a straight line. The constant speed of each particle , measured in the 
LAB frame is V = p>c. Show that their total energy, measured in the rest frame of A, is 
E°(1 + p, 2 )/(1-p,2). 

6-3 An atom of rest energy Ea° is initially at rest. It completely absorbs a photon of energy 
Eph, and the excited atom of rest energy Ea°* recoils freely. If the excitation energy of 
the atom is given by 

E 0 ( =Ea o *-Ea 0 , show that 

E* = -Ea° + Ea°{1 + (2E^Ea°F, exactly. 

If, as is often the case, Eph « Ea°, show that the recoil energy of the atom is 
Eiecoi ~ Eph^2EA 0 - 

Explain how this approximation can be deduced using a Newtonian-like analysis. 

64 A completely inelastic collision occurs between particle 1 and particle 2 (initially at 
rest ) to form a composite particle, 3. Show that the speed of 3 is 
V 3 = Vl/{1+(E2°/El)}, 

where vi and Ei are the speed and the total energy of 1 , and E 2 0 is the rest energy of 2. 

6-5 Show that the minimum energy that a y-ray must have to just break up a deuteron 
into a neutron and a proton isy™ * 2.23 MeV, given 
Eneut°= 939.5656 MeV, 


Eprot 0 = 938.2723 MeV, and 
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Edeut 0 = 1 875.61 34 MeV. 

6-6 In a general relativistic collision: 

1 + 2 n-particles 

-> (3 + 4 + ...m) + (m+1 , m+2. + ...n) 

where the particles 3 -» m are “observed”, and the particles m+l^-n are 
“unobserved”. We have 

Ei + E 2 = (E3 + E4 + ...E m ) + (Em+i + Em+2 + ...E n ), the total energy, 

— £obs+ £unobs 

and 

pi + p 2 = p obs + p unobs . 

If W un o bs / C 2 is the unobserved (missing) mass of the particles m+1 to n, show that, in 
the LAB frame 

(VW°bs)2 = (Ef + E 2 ° ~ Z0=3,m] Ei L ) 2 - (Plk) - CX[=3,m] P L ) 2 . 

This is the missing (energy) 2 in terms of the observed quantities. This is the principle 
behind the socalled “missing-mass spectrometers” used in Nuclear and Particle 
Physics. 

6.7 If the contravariant 4-force is defined as P = dPVdx = [f°, fl where x is the proper time, and P l is the 
contravariant4momentum, show that 

F V,, = 0, where V u is the covariant 4-velocity. (The 4 force and the 4velocity are orthogonal). 
Obtain dE/dt in terms ofy, v, and f. 
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7 

NEWTONIAN GRAVITATION 

We come now to one of the highlights in the history of intellectual endeavor, namely Newton’s 
Theory of Gravitation. This spectacular work ranks with a handful of masterpieces in Natural 
Philosophy — the Galileo-Newton Theory of Motion, the Camot-Clausius-Kelvin Theory of Heat and 
Thermodynamics, Maxwell’s Theory of Electromagnetism, the Maxwell-Boltzmann-Gibbs Theory of 
Statistical Mechanics, Einstein’s Theories of Special and General Relativity, Planck’ s Quantum Theory 
of Radiation, and the Bohr-deBroglie-Schrodinger-Heisenberg Quantum Theory of Matter. 

Newton’s most significant ideas on Gravitation were developed in his early twenties at a time 
when the University of Cambridge closed down because of the Great Plague. He retimed to his 
home, a farm at Woolsthorpe-by-Colsterworth, in Lincolnshire. It is a part of England dominated by 
vast, changing skies; a region buffeted by the winds from the North Sea. The thoughts of the young 
Newton naturally turned skyward — there was little on the ground to stir his imagination except 
perhaps, the proverbial apple tree and the falling apple. Newton’s work set us on a new course. 

Before discussing the details of the theory, it will be useful to give an overview using the 
simplest model, consistent with logical accuracy. In this way, we can appreciate Newton’s radical 
ideas, and his development of the now standard “Scientific Method “ in which a crucial interplay exists 
between the results of observations and mathematical models that best account for the observations. 
The great theories are often based upon relatively small numbers of observations. The uncovering of 
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the Laws of Nature requires deep and imaginative thoughts that go far beyond the demonstration of 
mathematical prowess. 

Newton’s development of Differential Calculus in the late 1 660’s was strongly influenced by his 
attempts to understand, analytically, the empirical ideas concerning motion that had been put forward 
by Galileo. In particular, he investigated the analytical properties of motion in curved paths. These 
properties are required in his Theory of Gravitation. We shall consider motion in 2-dimensions. 

7.1 Properties of motion along curved paths in the plane 

The velocity of a point in the plane is a vector, drawn at the point, such that its component in 
any direction is given by the rate of change of the displacement, in that direction. Consider the following 
diagram 



P and Q are the positions of a point moving along the curved path AB. The coordinates are P [x, y] at 
time t and Q [x + Ax, y + Ay] at time t + At The components of the velocity of the point are 
lim(At— >0) Ax/At = dx/dt = v x , 


and 
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lim(At— >0) Ay/At = dy/dt =v y . 

Ax and Ay are the components of the vector PQ. The velocity is therefore 
lim(AH>0) chordPQ/At . 

We have 

lim(Q-»P) chord PQ/As = 1 , 

where s is the length of the curve AP and As is the length of the arc PQ. 

Tine velocity can be written 

lim(AH>0) (chordPQ/As)(As/At) = ds/dt. (7.1) 

fine direction of the instantaneous velocity at P is along the tangent to the path at P. 
fine x- and y-components of the acceleration of P are 
im(At-*0) AvjAi = dv>/dt = d 2 x/dt 2 , 
and 

lim(AH>0) AVy/At = dvy/dt = d 2 y/dt 2 . 
fine resultant acceleration is not directed along the tangent at P. 

Consider the motion of P along the curve APQB: 
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Tine change Av in the vectorv is shown in the diagram: 

Av 

The vector Av can be written in terms of two components, a, perpendicular to the direction ofv, and b, 
along the direction ofv + Av: The acceleration is 
lim(At-»0) Av/At, 

Tie component along a is 

lim(At— >0) Aa/At = lim(At->0) vAip/At = lim(At-»0) (vAip/As)(As/At) 

=v 2 (dip/ds)=v 2 /p (7.2) 

where 

p = ds/dip, is the radius of curvature at P. (7.3) 

The direction of this component of the acceleration is along the inward normal at P. 

If the particle moves in a circle of radius R then ts acceleration towards the center is v 2 /R, a result first 
given by Newton. 

The component of acceleration along the tangent at P is dv/dt = v(dv/ds) = cPs/dt 2 . 

7.2 An overview of Newtonian gravitation 

Newton considered the fundamental properties of motion, embodied in his three Laws, to be 
universal in character— the natural laws apply to all motions of all particles throughout all space, at all 
times. Such considerations form the basis of a Natural Philosophy. In the Principia, Newton wrote . . .”1 
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began to think of gravity as extending to the orb of the Moon. . He reasoned that the Moon, in its 
steady orbit around the Earth, is always accelerating towards the Earth. He estimated the acceleration 
as follows: 

If the orbit of the Moon is circular (a reasonable assumption), the dynamical problem is 



Moon 

The acceleration of the Moon 
towards the Earth is 
|a R |= v 2 /R 


Newton calculated v = 2ttR fT, where R =240,000 miles, and T = 27.4 days, the period, 
so that 

Sr - 4tt 2 R/T 2 ~ 0.007 ft/sec 2 . (7.4) 

He knew that all objects, close to the surface of the Earth, aocelerate towards the Earth with a value 
determined by Galileo, namely g * 32 fl/sec 2 . He was therefore faced with the problem of explaining 
the origin of the very large difference between the value of the acceleration 3r, nearly a quarter of a 
million mites away from Earth, and the local value, g. 

He had previously formulated his 2nd Law that relates force to acceleration, and therefore he 
reasoned that the difference between the accelerations, 8r and g, must be associated with a property 
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of the force acting between the Earth and the Moon — the force must decrease in some unknown 
way. 

Newton then introduced his conviction that the force of gravity between objects is a universal 
force; each planet in the solar system interacts with the Sun via the same basic force, and therefore 
undergoes a characteristic acceleration towards the Sun. He concluded that the answer to the 
problem of the nature of the gravitational force must be contained in the three empirical Laws of 
Planetary Motion announced by Kepler, a few decades before. The three Laws are 

1 ) The planets describe ellipses about the Sun as focus, 

2) The line joining the planet to the Sun sweeps out equal areas in equal intervals 

of time, 

and 

3) The period of a planet is proportional to the length of the semi-major axis of 

the orbit raised to the power of 3/2. 

These remarkable Laws were deduced after an exhaustive study of the motion of the planets, 
made over a period of about 50 years by Tycho Brahe and Kepler. 

The 3rd Law was of particular interest to Newton because it relates the square of the period to 
the cube of the radius for a circular orbit: 

T 2 oc R3 (7.5) 

or 


T 2 = CR 3 , 
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where C is a constant. He replaced the specific value of (RTF) that occurs in the expression for the 
acceleration of the Moon towards the Earth with the value obtained from Kepler's 3rd Law and 
obtained a value for the acceleration aR: 

aR = v 2 /R = 4 tt 2 R/P (Newton) (7.6) 


but 


R/P = 1/CR 2 (Kepler) 

therefore 


(7.7) 


a R = 4 tt 2 (R/T 2 ) 

= (4tt 2 /C)(1/R 2 ) (Newton). (7.8) 

Tine acceleration of the Moon towards the Earth varies as the inverse square of the distance between 
them. 


Newton was now prepared to develop a general theory of gravitation. If the acceleration of a planet 
towards the Sun depends on the inverse square of their separation, then the force between them can 
be written, using the 2nd Law of Motion, as follows 

F = Mplanet 3planet = Mp|anet(4TI 2 /C)(1/R 2 ). (7.9) 

At this point, Newton introduced the first symmetry argument in Physics: if the planet 
experiences a force from the Sun then the Sun must experience the same force from the planet (the 
3rd Law of Motion!). He therefore argued that the expression for the force between the planet and the 
Sun must contain, explicitly, the masses of the planet and the Sun. The gravitational force Fg between 


them therefore has the form 
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Fg = GMsunMpbne/R 2 , (7.10) 

where G is a constant. 

Newton saw no reason to limit this form to the Sun-planet system, and therefore he 
announced that for any two spherical masses, Mi and M 2 , the gravitational force between them is 
given by 

Fg=GMiM^R 2 , (7.11) 

where G is a universal constant of Nature. 

All evidence points to the fact that the gravitational force between two masses is always 
attractive. 

Returning to the Earth-Moon system, the force on the Moon (mass Mm) in orbit is 

Fr = GMeMm/R 2 = Mm3r (7.12) 

so that 

aR = GMe/R 2 , which is independent of Mm. (The cancellation of the mass Mm in 
the expressions for Fr involves an important point that is discussed later in the section 8 . 1 ). 

At the surface of the Earth, the acceleration, g, of a mass M is essentially constant. It does not depend 
on the value of the mass, M, thus 

g = GMe/Re 2 , where Re is the radius of the Earth. (7.13) 

(It took Newton many years to prove that the entire mass of the Earth, Me, is equivalent to a point mass, 
Me, located at the center of the Earth when calculating the Earth’s gravitational interaction with a mass 
on its surface. This result depends on the exact 1 /Feature of the force). 



Tine ratio of the accelerations, aVg, is therefore 
a^g = (GMe/R 2 )/(GMe/Re 2 ) = (Re/R) 2 
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(7.14) 

Newton knew from observations that the ratio of the radius of the Earth to the radius of the Moon’s orbit 
is about 1/60, and therefore he obtained 
ap/g* (1/60) 2 =1/3600. 

so that 

a R = g/3600 = (32/3600)ft/setf = 0.007...fi/sec 2 . 

In one of the great understatements of analysis, Newton said, in comparing this result with the value for 
a R that he had deduced using a R = v 2 /R . . .’’that it agreed pretty nearly” . . .The discrepancy came 
largely from the errors in the observed ratio of the radii. 

7.3 Gravitation: an example of a central force 

Central forces, in which a particle moves under the influence of a force that acts on the particle 
in such a way that it is always directed towards a single point — the center of force — form an 
important class of problems . Let the center of force be chosen as the origin of coordinates: 



0 
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Tine description of partide motion in terms of polar coordinates (Chapter2), is well-suited to the analysis 
of the central force problem. For general motion, the acceleration of a point P[r, ty] moving in the plane 
has the following components in the r- and “c|>”- directions 

ar = Ur(d 2 r/dt 2 — r(d(p/dt) 2 ), (7.15) 

and 

= u^d^dt 2 ) + 2(dr/dt)(d4>/dt)), (7.16) 

where u r and u (|) are unit vectors in the r- and Infections. 

In the central force problem, the force F is always directed towards 0, and therefore the 
component a q „ perpendicular to r, is always zero: 

a ([) = u (| ,(r(d2(|Vdt 2 ) + 2(dr/dt)(d(j)/dt) = 0, (7.17) 

and therefore 

r(cPc|)/dt 2 ) + 2(dr/dt)(dc|)/dt) = 0. (7.18) 

This is the equation of motion of a particle moving under the influence of a central force, centered at 0. 

If we take the Sun as the (fixed) center offeree, the motion of a planet moving about the Sun is 
given by this equation. Tie differential equation can be solved by making the substitution 

co = d(p/dt, (7.19) 

giving 

rdco/dt + 2co(dr/dt) = 0, (7.20) 

or 


rdo)=-2codr. 



131 


Separating the variables, we obtain 
d(o/oa=-2dr/r. 

Integrating, gives 

logeoo = -2loger + C (constant), 

therefore 

log e (oor 2 ) = C. 

Taking antilogs gives 

fto = r^ckjVdt) = e c = k, a constant. (7.21 ) 

7.4 Motion under a central force and the conservation of angular momentum 

The above solution of the equation of motion of a particle of mass m, moving under the 
influence of a central force at the origin, 0, can be multiplied throughout by the mass m to give 

mr 2 (d(|)/dt) = mk (7.22) 

or 

mr(r(dcj)/dt)) = K, a constant for a given mass, (7.23) 

We note that r(dcp/dt) = v (|1 , the component of velocity perpendicular to r, therefore 

angular momentum of m about 0 = r(mv (|) ) = K, a constant of the motion fora 

central force. 

7.5 Kepler’s 2nd law explained 

The equation r 2 (dc|)/dt) = constant K, can be interpreted in terms of an element 
of area swept out by the radius vector r, as follows 



132 



From the diagram, we see that the following inequality holds 
r 2 Acj)/2 < AA < (r + ArfA$l2 


or 

r 2 /2<AA/A(|)<(r+Ar) 2 /2. 

When Acj) — > 0, r + Ar — > r, so that, in the limit 
dA/dcj) = r 2 /2. 

The element of area is therefore 
dA = r2d(|>/2. 

Twice the time rate of change of this element is therefore 

2dA/dt=r 2 (d(|)/dt). (7.24) 

We recognize that this expression is equal to k, the constant that occurs in the solution of the 
differential equation of motion for a central path. The radius vector r therefore sweeps out area at a 
constant rate. This is Kepler’s 2nd Law of Planetary Motion; it is seen to be a direct consequence of the 
fact that the gravitational attraction between the Sun and a planet is a central force problem. 



133 


7.6 Central orbits 

A central orbit must be a plane curve (there is no force out of the plane), and the moment of 
the velocity r 2 (d(|)/dt), about the center of force, must be a constant of the motion. Tbe moment can be 
written in three equivalent ways: 



Tbe moment of the velocity about 0 is then 

r(r(dcf>/dt) = pv = x(dy/dt)-y(dx/dt) 

= a constant, h, say. (7.25) 

Tbe result r 2 (d(|)/dt) = constant for a central force can be derived in the following alternative way: 
Tbe time derivative of r 2 (dcjVdt) is 

(d/dt)(r 2 (dc|)/dt)) = r 2 (dW) + (d<fVdt)2r(dr/dt) (7.26) 

If this equation is divided throughout by r then 

(1/r)(d/dt)(r 2 (d(|)/dt)) = r(d W) + 2(dr/dt)(dc|Vdt) (7.27) 

= the transverse acceleration 

= 0 for a central force. (7.28) 

Integrating then gives 


r 2 (d(|)/dt) = constant for a central force. 


(7.29) 
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7.6.1 The law of force in [p, r] coordinates 

There are advantages to be gained in using a new set of coordinates— [p, r] coordinates — in 
which a point P in the plane is defined in terms of the radial distance r from the origin, and the 
perpendicular distance p from the origin onto the tangent to the path at P. (See following diagram). 

Let a particle of unit mass move along a path under the influence of a central force directed 
towards a fixed point, 0. Let ac be the central acceleration of the unit mass at P, let the perpendicular 
distance from 0 to the tangent at P be p, and let the instantaneous radius of curvature of the path at the 
point P be p: 



The component of the central acceleration along the inward normal at P is 

a ± =adsina=v 2 /p = ac(p/r). (7.30) 

The instantaneous radius of curvature is given by 

p = r(dr/dp). (7.31) 

For all central forces, 


pv= constant =h : 


(7.32) 
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therefore 

a ± = v 2 /p = (h 2 /p 2 )( 1 /r)(dp/dr) = a(p/r), (7.33) 

so that 


ac=(h 2 /p 3 )(dp/dr). (7.34) 

This differential equation is the law of force per unit mass given the orbit in [p, r] coordinates. 

(It is left as a problem to show that given the orbit in polar coordinates, the law of force perunit mass is 
dc = h 2 u2{u + d 2 u/d(|)2}, where u = 1/r ). (7.35) 

In order to find the law offeree per unit mass (acceleration), given the [p, r] equation of the orbit, 
it is necessary to calculate dp/dr. For example, if the orbit is parabolic, the [p, r] equation can be 
obtained as follows 



(7.36) 

(7.37) 


The triangles FAQ and FQP are similar, therefore 


p/a = r/p,whereAF = a, 


giving 


1/p 2 = 1/ar, the p-r equation of a parabola. 


Differentiating this equation, we obtain 
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(1/p 3 )dp/dr= 1/2ar 2 . (7.38) 

Tine law of acceleration for the parabolic central orbit is therefore 

ac = (h 2 /p 3 )dp/dr = (h 2 /2a)(1/r 2 ) = constant/r 2 . (7.39) 

Tine instantaneous speed of P is given by the equation v = h/p; we therefore find 

v=h/Var. (7.40) 

This approach can be taken in discussing central orbits with elliptic and hyperbolic forms. 

Consider the ellipse 



The foci are Fi and F 2 , the semi-major axis is a, the semi-minor axis is b, the radius vectors to the point 
P [r, c|>] are ri and r 2 , and the perpendiculars from Fi and F 2 onto the tangent at P are pi and jpz. 

Using standard results from analytic geometry, we have for the ellipse 

1) n + r 2 = 2a, (7.41 a-c) 

2) pip 2 = b 2 , and 

3) angle QPFi = angle RPF 2 . 

The triangles FiQP and F 2 RP are similar, and therefore 


Pi/ri = P 2 A 2 


(7.42) 
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or 

(piP 2 /nr 2 ) ie = b/{n(2a - n)} 1,2 = pi/n 

so that 

b2/pi 2 = 2a/n-1. (7.43) 

This is the [p, r] equation of an ellipse. 

The [p, r] equation for the hyperbola can be obtained using a similar analysis. The standard results 
from analytical geometry that apply in this case are 

1) pip 2 = b 2 , (7.44 a-c) 

2) r 2 — n =2a, and 

3) the tangent at P bisects the angle between the focal distances. 

(b 2 = a 2 (e2 - 1 ) where e is the eccentricity (e 2 > 1), and 2b2/a is the latus rectum). 

We therefore obtain 

bi 2 /pi 2 = 2a/n + 1. (7.45) 

This is the [p, r] equation of an hyperbola. 

7.7 Bound and unbound orbits 

For a central force, we have the equation for the acceleration in [p, r] form 
(h 2 /p 3 )dp/dr=a c . (7.46) 

If the acceleration varies as 1/r 2 , then the form of the orbit is given by separating the variables, and 


integrating, thus 

h 2 Jdp/p 3 = kjdr/r 2 , 


(7.47) 
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so that 

— h 2 /2p2 = — k/r, where k is a constant, or 

If/p 2 = 2k/r + C, where the value of C depends on the form of the orbit. 

Comparing this form with the general form of the [p, r] equations of conic sections, we see that the orbit 
is an ellipse, parabola, or hyperbola depending on the value of C. If 
C is negative, the orbit is an ellipse, 

C is zero, the orbit is a parabola, 

and if 

C is positive, the orbit is an hyperbola. 

The speed of the particle in a central orbit is given by v = h/p. If, therefore, the particle is 
projected from the origin, 0 (corresponding to r = ro) with a speed vo, then 

If/p 2 = Vo 2 = 2k/r 0 + C, (7.48) 

so that the orbit is 

1 ) an ellipse ifvo 2 < 2k/ro, (7.49 a-c) 

2) a parabola if Vo 2 = 2k/ro, 
or 

3) an hyperbola if Vo 2 > 2k/ro. 

The escape velocity, the initial velocity required for the particle to go into an unbound orbit is 


therefore given by 
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v 2 escape = 2k/ro = 2 GMe/Re, for a partide launched from the surface of the Earth. This 
condition is, in fact, an energy equation 

(1/2)(m = 1)v 2 escape = GM E (m = 1)/Re . (7.50) 

^ * 

kinetic energy potential energy 

7. 8 The concept of the gravitational field 

Newton was well-aware of the great difficulties that arise in any theory of the gravitational 
interaction between two masses not in direct contact with each other. In th ePrincipia, he assumes, in 
the absence of any experimental knowledge of the speed of propagation of the gravitational interaction, 
that the interaction takes place instantaneously. However, in letters to other luminaries of his day, he 
postulated an intervening agent between two approaching masses — an agent that requires a finite 
time to react. In the early 17th century, the problem of understanding the interaction between spatially 
separated objects appeared in a new guise, this time in discussions of the electromagnetic interaction 
between charged objects. Faraday introduced the idea of a Held offeree with dynamical properties. In 
the Faraday model, an accelerating electric charge acts as the source of a dynamical electromagnetic 
field that travels at a finite speed through spacetime, and interacts with a distant charge. Energy and 
momentum are thereby transferred from one charged object to another distant charged object. 

Maxwell developed Faraday’s idea into a mathematical theory— the electromagnetic theory of light — 
in which the speed of propagation of light appears as a fundamental constant of Nature. His theory 
involves the differential equations of motion of the electric and magnetic field vectors; the equations are 
not invariant under the Galilean transformation but they are invariant under the Lorentz transformation. 
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(The discovery of the transformation that leaves Maxwell’s equations invariant for all inertial observers 
was made by Lorentz in 1 897). We have previously discussed the development of the Special Theory 
of Relativity by Einstein, a theory in which there is but one universal constant, c, hr the speed of 
propagation of a dynamical field in a vacuum. This means that c is not only the speed of light in free 
space but also the speed of the gravitational field in the void between interacting masses. 

We can gain some insight into the dynamical properties associated with the interaction 
between distant masses by investigating the effect of a finite speed of propagation, c, of the gravitational 
interaction on Newton’s Laws of Motion. Consider a non-orbiting mass M, at a distance R from a mass 
mass Ms, simply falling from rest with an acceleration a(R) towards Ms. According to Newton’s Theory 
of Gravitation, the magnitude of the force on the mass M is 

|F(R)| = GMsM/R 2 = Ma(R), (7.51) 

We therefore have 

a(R) = GMs/R 2 . (7.52) 

Let At be the time that it takes for the gravitational interaction to travel the distance R at the universal 
speed c, so that 

At=R/c. (7.53) 

In the time interval At the mass M moves a distance, AR, towards the mass IVL; 

AR = aAT/2 


= (GMs/R 2 )At 2 /2 
= (GM^R 2 )(R/c)2/2. 


(7.54) 
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Consider the situation in which the mass M is in a circular orbit of radius R about the mass, Ms. 
Let v(t) be the velocity of the mass M at time t, and v(t + At) its velocity at t + At, where At is chosen to 
be the interaction travel time. Let us consider the motion of M if there were no mass Ms present, and 
therefore no interaction; the mass M then would continue its motion with constant velocity v(t) in a 
straight line. We are interested in the difference in the positions of M at time t + At with and without the 
mass Ms in place. We have, to a good approximation: 

M v(t) 

• <- “extrapolated position” (no mass Ms) 

R 

v(t+At) 

The magnitude of the gravitational force, Fex, at the extrapolated position, with Ms in place, is 


Fex = GM s M/(R + AR)2 (7.55) 

= (GM S M/R 2 )(1 + ARRY 2 

* (GMsM/R 2 )(1 -2AR/R), for AR « R. (7.56) 

Substituting the value of AR obtained above, we find 

Fex « GMsM/R 2 - (GM s M/Rc 2 )(GMs/R 2 ). (7.57) 

Nerwton’s 3rd Law states that 

Fms,m=-Fm,ms (7.58) 
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This Law is true, however, for contact interactions only. For all interactions that take place between 
separated objects, there is a mis-match between the action and the reaction. It takes time for one 
particle to respond to the presence of the other! 

In the present example, we obtain a good estimate of the mismatch by taking the difference between 
Fex(R + AR) and F(R), namely 

Fex(R + AR) - F(R) « (GM s M/Rc 2 )(GMs/R 2 ). (7.59) 

On the right-hand side of this equation, we note that the term (GMs/R 2 ) has dimensions of 
“acceleration”, and therefore the term (GMsIWRc 2 ) must have dimensions of “mass”. We see that this 
term is an estimate of the “mass” associated with the interaction, itself. The space between the 
interacting masses must be endowed with this effective mass if Newton’s 3rd Law is to include non- 
contact interactions, fine appearance of the term c? in the denominator of this effective mass term has 
a special significance. If we invoke Einstein’s famous relation E = Me?, then AE = AMc 2 so that the 
effective mass of the gravitational interaction can be written as an effective energy: 

AEgrav = GMsM/R. (7.60) 

This is the “energy stored in the gravitational field” befween the two interacting masses. Note that it has 
a 1/R-dependence — the correct form for the potential energy associated with a 1/R 2 gravitational 
force. We see that the notion of a dynamical field of force is a necessary consequence of the finite 
propagation time of the interaction. 
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7.9 The gravitational potential 

The concept of a gravitational potential has its origins in the work of Leibniz. The potential 
energy, V(x), asssociated with n interacting particles, of masses mi, m 2 , ...m n , situated atxi, X2, ...Xn, is 
related to the gravitational force on a mass M atx, due to the n particles, by the equation 

F(x)=-W(x). (7.61) 

The exact forms of F(x) and V(x) are 

F(x) = — GM^p=i,n] mi(x— Xi)/|x— Xi| 3 , (7.62) 

and 

V(x) =-GMXp=i,n] m/|x-x| . 

In upper-index notation, the components of the force are 

F k (x)=-3V/5x k ,k = 1,2, 3. (7.63) 

The gravitational field, g(x), is the force per unit mass: 

g(x) = F(x)/M, (7.64) 

and the gravitational potential is defined as 

0(x) = V(x)/M = -Xo=i.n] Gm/|x-x|. (7.65) 

The sign of the potential is chosen to be negative because the gravitational force is alviays attractive. 
(This convention agrees with that used in Electrostatics). 

If the mass consists of a continuous distribution that can be described by a mass density p(x), 
then the potential is 


0(x) =- j(Q3(x')/|x-x'|) d¥. 


(7.66) 
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It is left as an exercise to show that this form of O means that the potential obeys Poisson’s equation 
V 2 O(x)-4'nGp(x) = 0. 

We should note that the gravitational potential of a mass M has the form 

V(r)=-GIWr (7.67) 

only around a mass distribution with spherical symmetry. For an arbitrary mass distribution, the 
potential can be written as a series of multipoles. 

fine potential of a circular disc at a point on its axis can be found asfollows 



Let the disc be divided into concentric circles. The potential at P, on the axis, due to the elemental ring 
of radius r and width dr is 2irrdrGa/PQ, where a is the mass per unit area of the disc. The potential at 
P of the entire disc is therefore 

V P =jp,a]2TiGordr/PQ, (7.68) 

where a is the radius of the disc. Therefore, 


V P = 2TiGcrJ[o,a] rdr/(r 2 + p 2 ) 12 
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=2TiGo[(r 2 +p 2 ) 1c ]p, a] 

= 2tcGo(R — p), (7.69) 

where R is the distance of P from any point on the circumference. 

PROBLEMS 

7-1 Show that the gravitational potential of a thin spherical shell of radius R and mass M at 
a point Pis 

1 ) GM/d where d is the distance from P to the center of the shell if d >R, and 

2) GM/R if P is inside or on the shell. 

7-2 If d is the distance from the center of a solid sphere (radius R and density p) to a point 
P inside the sphere, show that the gravitational potential at P is 
Op=2TiGp(R 2 -d 2 /3). 

7-3 Show that the gravitational attraction of a circular disc of radius R and mass per unit 
area a, at a point P distant p from the center of the disc, and on the axis, is 
2 ttGoMP 2 + R 2 H-1}. 

74 A particle moves in an ellipse abouta center offeree at a focus. Provethatthe 
instantaneous velocity v of the particle at any point in its orbit can be resolved into 
two components, each of constant magnitude: 1) of magnitude ah/tf, perpendicular to 
the radius vector rat the point, and 2) of magnitude ahe/b? perpendicular to the major 
axis of the ellipse. Here, a and b are the semi-major and semi-minor axes, e is the 
eccentricity, and h = pv = constant for a central orbit. 
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7.5 A particle moves in an orbit under a central acceleration a = k/r 1 2 where k = constant. 
If the particle is projected with an initial velocity vo in a direction at right angles to 
the radius vector r when at a distance ro from the center offeree (the origin ), prove 
(dr/dt) 2 ={(2k/ro)-vo 2 (1 + WM-1}. 

This problem involves the energy and momentum equations in r^ coordinates. 

7-6 A particle moves in a cardioidal orbit, r = a(1 + cosej)), under a central force 



1 ) show that the p-r equation of the cardioid is p 2 = r 3 /2a, and 

2) show that the central acceleration is 3alf /r 4 , where h = pv= constant. 

7-7 A planet moves in a circular orbit of radius r aboutthe Sun as focus at the center. 

If the gravitational “constant” G changes slowly with time— G(t), then show that the 
angular velocity, go, of the planet and the radius of the orbit change in time according 
to the equations 

(1/co)(dco/dt) = (2/G)(dG/dt) and (1/r)(dr/dt) = (-1/G)(dG/dt). 

7-8 A particle moves under a central acceleration a = k(W) where k is a constant. 

If k = h 2 , where h = r^dep/dt) = pv, then show that the path is 

1/r = Acp + B, a “reciprocal spiral”, where A and B are constants. 
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8 

EINSTEINIAN GRAVITATION: 

AN INTRODUCTION TO GENERAL RELATIVITY 

8.1 The principle of equivalence 

The term “mass” that appears in Newton’s equation for the gravitational force between two 
interacting masses refers to “gravitational mass” — that property of matter that responds to the 
gravitational force . . .Newton’s Law should indicate this property of matter: 

Fg = GM G m G /r 2 , where M G and m G are the gravitational masses of the interacting 
objects, separated by a distance r. 

The term “mass” that appears in Newton’s equation of motion, F = ma, refers to the “inertial 
mass” — that property of matter that resists changes in its state of motion. Newton’s equation of 
motion should indicate this property of matter: 

F(r) = m'a(r), where m 1 is the inertial mass of the particle moving with an 
acceleration a(r) in the gravitational field of the mass M G . 

Newton showed by experiment that the inertial mass of an object is equal to its gravitational 
mass, m 1 = m G to an accuracy of 1 part in 1 0 3 . Recent experiments have shown this equality to be true 
to an accuracy of 1 part in 10 12 . Newton therefore took the equations 

F(r) = GM G m G /r 2 = m'a(r), (8.1) 

and used the condition m G = m 1 to obtain 


a(r) = GM G /r 2 . 


( 8 . 2 ) 
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Galileo had, of course, previously shown that objects made from different materials fall with the 
same acceleration in the gravitational field at the surface of the Earth, a result that implies nfr oc m 1 . 
This is the Newtonian Principle of Equivalence. 

Einstein used this Principle as a basis for a new Theory of Gravitation! He extended tie axioms of 
Special Relativity, that apply to field-free frames, to frames of reference in ‘free fall”. A freely tailing 
frame must be in a state of unpowered motion in a uniform gravitational field. Tie field region must be 
sufficiently small for there to be no measurable variation in the field throughout the region. If a field 
gradient does exist in the region then so called “tidal effects” are present, and these can, in principle, be 
determined (by distorting a liquid drop, for example). The results of all experiments carried out in ideal 
freely falling frames are therefore fully consistent with Special Relativity. All freely-falling observers 
measure the speed of light to be c, its constant free-space value. It is not possible to carry out 
experiments in ideal freely-falling frames that permit a distinction to be made between the acceleration 
of local, freely-falling objects, and their motion in an equivalent external gravitational field. As an 
immediate consequence of the extended Principle of Equivalence, Einstein showed that a beam of 
light would be deflected from its straight path in a close encounter with a sufficiently massive object. 
The observers would, themselves, be far removed from the gravitational field of the massive object 
causing the deflection. Einstein’s original calculation of the deflection of light from a distant star, grazing 
the Sun, as observed here on the Earth, included only those changes in time intervals that he had 
predicted would occur in the near field of the Sun. His result turned out to be in error by exactly a factor 
of two. He later obtained the “correct’ value for the deflection by including in the calculation the changes 
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in spatial intervals caused by the gravitational field. A plausible argument is given in the section 8.6 for 
introducing a non-intuitive concept the refractive index of spacetime due to a gravitational field. This 
concept is, perhaps, the characteristic physical feature of Einstein’s revolutionary General Theory of 
Relativity. 

8.2 Time and length changes in a gravitational field 

We have previously discussed the changes that occur in the measurement of length and time 
intervals in different inertial frames. These changes have their origin in the invariant speed of light and 
the necessary synchronization of clocks in a given inertial frame. Einstein showed that measurements 
of length and time intervals in a given gravitational potential are changed relative to the measurements 
made in a different gravitational potential. These fielcktependent changes are not to be confused with 
the Special-Relativistic changes discussed in 3.5. Although an exact treatment of this topic requires the 
solution of the full Einstein gravitational field equations, we can obtain some of the key results of the 
theory by making approximations that are valid in the case of our solar system. These approximations 
are treated in the following sections. 

8.3 The Schwarzschild line element 

An observer in an ideal freely-felling frame measures an invariant infinitesimal interval of the 
standard Special Relativistic form 

ds 2 = (cdt) 2 - (dx 2 + dy 2 + dz 2 ). (8.3) 

It is advantageous to transform this form to spherical polar coordinates, using the linear equations 
x = rsinGcoscj), y = rsinGsincp, and z = rcosG. 
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We then have 

z 

z 


Tine square of the length of the diagonal of the infinitesimal cube is seen to be 

dl 2 = dr 2 + (rde) 2 + (rsinGdcJ)) 2 . (8.4) 

The invariant interval can therefore be written 

ds 2 = (cdt f - dr 2 - r 2 (d02 + sin^). (8.5) 

The key question that now feces us is this: how do we introduce gravitation into the problem? We can 
solve the problem by introducing an energy equation into the argument. 

Consider two observers 0 and O', passing by one another in a state of free fall in a 
gravitational field due to a mass M, fixed at the origin of coordinates. Both observers measure a 
standard interval of spacetime, ds according to 0, and ds' according to O', so that 

ds 2 = ds' 2 = (cdt*) 2 - dr' 2 - r' 2 (d0' 2 + sin 2 0'dc|)' 2 ) (8.6) 



Tine situation is as shown 
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Let the observer O' just begin free fell towards M at the radial distance r, and let the observer 0, close 
to O', be freely falling away from the mass M. The observer 0 is in a state of unpowered motion with 
just the right amount of kinetic energy to “escape to infinity”. Since both observers are in states of free 
fall, we can, according to Einstein, treat them as if they were ‘inertial observers”, finis means that they 
can relate their local space-time measurements by a Lorentz transformation. In particular, they can 
relate their measurements of the squared intervals, ds 2 and ds' 2 , in the standard way. Since their 
relative motion is along the radial direction, r, time intervals and radial distances will be measured to be 
changed: 

At = yAt' and yAr = Ar', (8.7 a,b) 

where 

y = 1/{1 - (v/c) 2 } 1 ' 2 , in which v = vo(r) because vo’(r) * 0. 
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If 0 has just enough kinetic energy to escape to infinity, then we can equate the kinetic energy 
to the potential energy, so that 

vo 2 (r)/2 = 1 -O(r) if the observer 0 has unit mass. (8.8) 

<E>(r) is the gravitational potential at r due to the presence of the mass, M, at the origin. 

This procedure enables us to introduce the gravitational potential into the value ofy in the Lorentz 
transformation. We have vo 2 = 20(r) = v 2 , and therefore 

At=At7{1-20(r)/c 2 } 1/2 , (8.9) 

and 

Ar=Ar'{1-20(r)/tf} 1fi . (8.10) 

Only lengths parallel to r change, therefore 

r2(d0 2 + sin 2 0d(|> 2 ) = r' 2 (d0' 2 + sin0 'd(j>' 2 ), (8. 1 1 ) 

and therefore we obtain 


ds 2 = ds 2 - c 2 (1 -20(r)/c 2 )dt 2 -dr 2 /(1 -2O(r)/c 2 )-r 2 (d0 2 +sin 2 Od(|) 2 ). (8.12) 

If the potential is due to a mass M at the origin then 

O(r) = GM/r, (r > R, the radius of the mass, M) 

therefore, 


ds 2 = c 2 (1 -2GM/rc2)dt 2 -(1 - 2GM/rc 2 )- 1 dr 2 - r 2 (d0 2 + sinTJdcJ) 2 ). 

(8.13) 

This is the famous Schwarzschild line element, originally obtained as an exact solution of the Einstein 
field equations. The present approach fortuitously gives the exact result! 
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8.4 The metric in the presence of matter 

In the absence of matter, the invariant interval of space-time is 

ds 2 = ii, lv dx ! 'dx v (p, v = 0, 1 , 2, 3), (8.14) 

where 

t|mv = diag(1 , — 1 , — 1 , — 1) (8.15) 

is the metric of Special Relativity; it “lowers the indices” 

( 8 - 16 ) 

The form of the Schwarzschild line element ds?sch, shows that the metric g„ v in the presence of 
matter differs from r| [A , We have 

ds 2 sdi=g (W dx M dx v , (8.17) 

where 

dx° = cdt, dx 1 = dr, dx 2 = rd0, and dx 3 = rsinOdcj), 
and 

g, v =diag((1-x),-(1-x)-i,-(1-x)-i,-(1- x y-i) 

in which 

X=2GIWrc 2 . 

The Schwarzschild metric lowers the indices 

dx,, = g llv dx v , (8.18) 

so that 


ds 2 K t 1 =dx"dx ll . 


(8.19) 



154 


8.5 The weak field approximation 

If X = 2GM/rc 2 « 1, the coefficient, (1 - y}~\ of dr 2 in the Schwarzschild line element can be 
replaced by the leading term of its binomial expansion, (1 +x -■) to give the “weak field” line element: 
ds 2 w=(1-x)(cdt) 2 -(1 + x )dr 2 -r 2 (d0 2 +sin 2 0d(l) 2 ). (8.20) 

At the surface of the Sun, the value of x is 4.2 x 8A so that the weak field approximation is 
valid in all gravitational phenomena in our solar system. 

Consider a beam of light traveling radially in the weak field of a mass M, then 

ds^ = 0 (a light-like interval) , and d0 2 + sin^ldcj) 2 =0, (8.21 ) 

giving 

0 = (1 — x)(cdt) 2 — (1 +x)dr 2 . (8.22) 

The “velocity” of the light vl = dr/dt, as determined by observers far from the gravitational influence of M, 
is therefore 

Vl = c{(1 - x)/(1 + x)} 12 ^ c if x ^ 0. (8.23) 

(Observers in free fall near M always measure the speed of light to be c). 

Expanding the term {(1 - x)/(1 + x)} 1c to first order in x, we obtain 
v L (r)/c*(1-x/2 ...)(1-x/2 ...) 

= (1-x4 (8.24) 

Therefore 

v L (r)~c(1-2GIWrc 2 ...), (8.25) 

so that VL(r) < c in the presence of a mass M according to observers far removed from M. 
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8.6 The refractive index of space-time in the presence of mass 

In Geometrical Optics, the refractive index, n, of a material is defined as 

n = C/Vmedium (8.26) 

where Vmedun is the speed of light in the medium. We introduce the concept of the refractive index of 
space-time, no(r), at a point r in the gravitational field of a mass, M: 
nG = c/v L (r) 

* 1/(1 - x ) 

= 1 +/ to first-order in x- 

= 1 + 2GM/rc 2 . (8.27) 

The value of nG increases as r decreases . This effect can be interpreted as an increase in the “density” 
of space-time as M is approached. 

8.7 The deflection of light grazing the sun 

As a plane wave of light approaches a spherical mass, those parts of the wave front nearest 
the mass are slowed down more than those parts farthest from the mass. The speed of the wave front 
is no longer constant along its surface, and therefore the normal to the surface must be deflected: 
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Tine deflection of a plane wave of light by a spherical mass, M, as it travels through space-time can be 
calculated in the weak field approximation. We choose coordinates as shown 



We have shown that the speed of light (moving radially) in a gravitational field, measured by an 
observer far from the source of the field, depends on the distance, r, from the source 

v(r) = c(1 — 2GM/rc 2 ) (8.28) 

where c is the invariant speed of light as r ^ 00 . 

We wish to compare dx with dx', the distances travelled in the x-direction by the wavefront at y and y + 
dy, in the interval dt. 

We have 


r 2 = (y+R)2+x 2 


(8.29) 
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therefore v(r) -> v(x, y) so that 
2r(3r/dy) = 2(y + R), 
and 

3r/3y = (y + R )/r . (8.30) 

Very dose to the surface of the mass M (radius R), the gradient is 

dr/dy|y^o R/r. (8.31) 

Now, 

dv(r)/<9y - (d/3r)(c(1 - 2GM/rc 2 ))(«9r/<9y) 

= (2GM/r 2 c)(3r/5y). (8.32) 

We therefore obtain 

5v(r)/5y| y _o = (2GM/r 2 c)(R/r) = 2GMR/r3c. (8.33) 

Let the speed of the wavefront be v' at y + dy and v at y. The distances moved in the interval dt are 
therefore 

dx' = v'dt and dx = vdt. (8.34 a,b) 

The first-order Taylor expansion ofv' is 
v'=v+(3v/3y)dy, 

and therefore 

dx' - dx = (v + (3v/3y)dy)dt- vdt = (3v/3y)dydt. (8.35) 

Let the corresponding angle of deflection of the normal to the wavefront be da, then 


da = (dx'-dx)/dy 
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= (dv/dy)dt = (dv/dy)(dx/v). (8.36) 

Tine total deflection of the normal to the plane wavefront is therefore 

Aa = jp»o,=o](5v/5y)(dxA/) (8.37) 

~ (1/c)jpx> M] (<9v/3y)dx . 

(v=c over most of the range of the integral). 

flie portion of the wavefront that grazes the surface of the mass M (y-» 0) therefore undergoes a total 
deflection 

Aa « (1/c)|p»,oo](2GMR/rfo)dx (8.38) 

= 2GMR/c 2 jpo,oo ] dx/(R 2 +X 2 ) 32 
=2GMR/c 2 [x/(R 2 (R 2 +x 2 ) 1 ^” 

= 2(GMR/c 2 )(2/R 2 ). 

so that 

Aa = 4GM/Rc 2 . 

This is Einstein’s famous prediction; putting in the known values for G, M, R, and c, gives 

Aa = 1 .75 arcseconds. (8.39) 

Measurements of this very small effect, made during total eclipses of the Sun at various times and 
places since 1919, are folly consistent with Einstein’s prediction. 

PROBLEMS 

8-1 If a particle A is launched with a veloaty vqa from a point P on the surface of the 
Earth at the same instant that a particle B is dropped from a point Q, use the Principle 
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of Equivalence to show that if A and B are to collide then vqa must be directed along PQ. 


Q g{[ 



8-2 A satellite is in a circular orbit above the Earth. It carries a clock that is similar to a 
clock on the Earth. There are two effects that must be taken into account in 
comparing the rates of the two clocks. 1 ) the time shift due to their relative speeds 
(Special Relativity), and 2) the time shiftdue to their different gravitational potentials 
(General Relativity). Calculate the SR shift to secondorder in (v/c), where v is the 
orbital speed , and the GR shift to the same order. In calculating the difference in the 
potentials , integrate from the surface of the Earth to the orbit radius. Tbe two 
effects differ in sign. Show that the total relative change in the frequency of the 
satellite clock compared with the Earth clock is 

(Av/ve) ~ (gRE/c 2 X1 - (3RE/2rs)}, where rs is the radius of the 
satellite orbit (measured from the center of the Earth). We see that if the altitude of 
the satellite is > Re/2 (~ 3200 km) Av is positive since the gravitational effect then 
predominates, whereas at altitudes less than -3200 km, the Special Relativity effect 
predominates. At an altitude - 3200 km, the clocks remain in synchronism. 
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9 

AN INTRODUCTION TO THE CALCULUS OF VARIATIONS 

9.1 The Euler equation 

A frequent problem in Differential Calculus is to find the stationary values (maxima and 
minima) of a function y(x). The necessary condition for a stationary value at x = a is 
dy/dx| x =a =0. 

Fora minimum, 

d 2 y/dx 2 | x =a > 0, 
and for a maximum, 

d 2 y/dx 2 | x =a < 0. 

The Calculus of Variations is concerned with a related problem, namely that of finding a 
function y(x) such that a definite integral taken over a function of this function shall be a maximum or a 
minimum. This is clearly a more complicated problem than that of simply finding the stationary values 
of a function, y(x). 

Explicitly, we wish to find that function y(x) that will cause the definite integral 

J[xi,x2] F(x, y, dy(x)/dx)dx (9.1) 

to have a stationary value. 

The integrand F is a function of y(x) as well as of x and dy(x)/dx. The limits xi and X 2 are assumed to be 
fixed , as are the values y(xi) and y(x 2 ). The integral has different values along different “paths” that 
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connect (xi , yi) and (X2, y^. Let a path be Y(x), and let this be one of a set of paths that are adjacent to 
y(x). We take Y(x) - y(x) to be an infinitesimal for every value of x in the range of integration. 

Let the difference be defined as 

Y(x) - y(x) = 6y(x) (a “first-order change”), (9.2) 

and 


F(x, Y(x), dY(x)/dx)- F(x, y(x), dy(x)/dx) = 6F. (9.3) 

The symbol 6 is called a variation ; it represents the change in the quantity to which it is applied 
as we go from y(x) to Y(x) at the same value ofx. Note Sx = 0, and 

6(dy/dx) = dY(x)/dx- dy(x)/dx = (d/dx)(Y(x) - y(x)) = (d/dx)(6y(x)). 

The symbols 6 and (d/dx) commute: 

5(d/dx)-(d/dx)8=0. (9.4) 


Graphically, we have 



0 Xl X 2 x 

Using the definition of 6F, we find 



Y(x) (d/dx)Y(x) 


— F(x, y, dy/dx) 


(9.5) 
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= (3F/3y)5y + (3F/3y')Sy' for fixed x. (Here, dy/dx = y'). 

The integral 

J[xi,x2] F(x, y, y')dx, (9.6) 

is stationary if its value along the path y is the same as its value along the varied path, y + Sy = Y. We 
therefore require 

J[xi,x2] 6 F(x, y, y')dx = 0 . (9.7) 

This integral can be written 

Jsd,)2|{(3F/3y)5y + (5F/5y')6y'}dx = 0. (9.8) 

The second term in this integral can be evaluated by parts, giving 

PF/ayMt (d/dxXaF/ay'Jdydx. (9.9) 

But 8 yi = 6 y 2 = 0 at the end-points xi and X 2 , therefore the term [ = 0, so that the stationary 

condition becomes 

J[xi,x 2 ] {3F/3y- (d/dx)3F/3y}6ydx = 0. (9.10) 

The infinitesimal quantity 6 y is positive and arbitrary, therefore, the integrand is zero: 

3F/3y- (d/dx)3F/3y' = 0. (9.11) 

This is known as Euleris equation. 

9.2 The Lagrange equations 

Lagrange, one of the greatest mathematicians of the 18th century, developed Euler's equation 
in order to treat the problem of particle dynamics within the framework of generalized coordinates. Fie 


made the transformation 
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F(x, y, dy/dx) -» L(t, u, du/dt) (9.12) 

where u is a generalized coordinate and du/dt is a generalized velocity. 

The Euler equation then becomes the Lagrange equation-of-motion: 

• • 

dlldu - (d/dt)(3L/du) = 0, where u is the generalized velocity. (9.13) 

The Lagrangian L(t; u, u) is defined in terms of the kinetic and potential energy of a particle, or system of 
particles: 

UT-V. (9.14) 

It is instructive to consider the Newtonian problem of the motion of a mass m, moving in the 
plane, under the influence of an inverse-square-law force of attraction using Lagrange’s equationsof- 
motion. Let the center offeree be at the origin of polar coordinates. The kinetic energy of m at [r, (()] is 


T=m((dr/dt) 2 + r 2 (d(|)/dt) 2 )/2, (9.15) 

and its potential energy is 

V = — k/r, where k is a constant. (9.16) 

Tie Lagrangian is therefore 

L = T- V = m((dr/dt)2 + r 2 (d(|Vdt)2)/2 + k/r. (9.17) 

Put r = u, and (f> = v, the generalized coordinates. We have, for the “u-equation” 

• • 

(d/dt)(3L0u) = (d/dt)(3L/3r) = (d/dt)(m(dr/dt)) = mcPr/dt 2 , (9.18) 

and 

8U8 u = dUd r = mr(dcjVdt) 2 — k/r 2 (9.19) 


Using Lagrange’s equatiorvof-motion for the u-coordinate, we have 
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m(d 2 r/dt 2 ) - mitckjj/dt) 2 + k/r 2 = 0 


(9.20) 


or 


m(cPr/dt 2 - r(d(jVdt) 2 ) = -k/r 2 . 

This is, as it should be, the Newtonian equation 

mass x acceleration in the r-direction = force in the r-direction. 

Introducing a second generalized coordinate, we have, for the “vequation” 

• • • 

(d/dt)(dUdv) = = (d/dt)(mr 2 cj)) 

•• • • 

= m(r 2 cj) + cj) 2r r), 


and 


(9.21) 


(9.22) 


<9U9v = <M|) = 0, 


(9.23) 


therefore 


so that 


m(r 2 cj) + 2r rcj)) = 0 
(d/dt)(mr 2 4>) = 0. 


(9.24) 


Integrating, we obtain 

mr 2 c|) = constant (9.25) 

showing, again, that the angular momentum is conserved. 

Tie advantages of using the Lagrangian method to solve dynamical problems stem from the 
fact that L is a scalarfundon of generalized coordinates. 
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9.3 The Hamilton equations 

The Lagrangian L is a function of the generalized coordinates and velocities, and 

the time: 

• • 

L=L(u,v,...;u,v,...;t). 


(9.26) 


If the discussion is limited to two coordinates, u and v, the total differential of L is 


• • • • 

dL = (dl_/du)du + (cUSv) dv + {dUdu)± + (5L75v)dv + {8Udt)± 

Consider the simplest case of a mass m moving along the x-axis in a potential, so that u = x 

• • 

and u=x=v x , then 


L=T-V=mvx2/2-V 


(9.27) 


and 


3L/dVx = mv x = px, the linear momentum. (9.28) 

• • 

In general, it is found that terms of the form dildu and 3L/3v are “momentum” terms; 

they are called generalized momenta, and are written 

• • 

dildu = pu, 3L/3v = pv, ..etc. (9.29) 

Such forms are not limited to “linear” momenta. 

The Lagrange equation 

(d/dt)(3LVdu) - 3LVdu = 0 (9.30) 

can be transformed, therefore, into an equation that involves the generalized momenta: 


(d/dt)(pu)-3L0u = O, or 
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3L/du = p u . 

Tine total differential of L is therefore 

• • • • 
dL = p u du + pvdv + p u du + pvdv + (3L/3t)dt. 

We now introduce an important function, the Hamiltonian function, H, defined by 

• • 

H = p u u + pvV-L, 


therefore 

• • • • 

dH = {p u du +udp u + pvdv + vdpv} - dL . 

It is not by chance that H is defined in the way given above. The definition permits the 

• • 

cancellation of the terms in dH that involve du and dv, so that dH depends only on du, dv, 
dp u , and dpv (and perhaps, t). We can therefore write 

H = f(u, v, pu, pv; t) (limiting the discussion to the two coordinates 
uandv). 

Tine total differential of H is therefore 


(9.31) 


(9.32) 


(9.33) 


(9.34) 


(9.35) 


dH = (3H/3u)du + (3H/3v)dv + (3H%)dp u + (3H/3pv)dpv + (3H0)dt. 

Comparing the two equations for dH, we obtain Hamilton’s equations-of-motion: 

• • 

3H/3u=-p u , 3H/3v=-pv, 

• • 

3H/dp u = u, 3H/3pv=v, 

and 


(9.36) 


(9.37) 

(9.38) 


3H 0=-3L/a 


(9.39) 


We see that 
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H = puU + pvV-(T-V). 

(9.40) 

If we consider a mass m moving in the (x, y)-plane then 


H = (mvx)v x + (mvy)v y -T + V 

(9.41) 

= 2(mv x 2 /2 + mv//2 ) - T + V 


=T + V, the total energy. 

(9.42) 


In advanced treatments of Analytical Dynamics, this form of the Hamiltonian is shown to have general 

validity. 

PROBLEMS 

9-1 Studies of geodesics — the shortest distance between two points on a surface — 
form a natural part of the Calculus of Variations. Show that the straight line 
between two points in a plane is the shortest distance between them. 

9-2 The surface generated by revolving the y-coordinate about the x-axis has an area 
2TcJyds where ds = {dx 2 + dy 2 } 1,2 Use Euler's variational method to show that 
the surface of revolution is a minimum if 

(dy/dx) = {(y 2 /a 2 ) - 1} 12 where a = constant. 

Hence show that the equation of the minimum surface is 

y = acosh{(x/a) + b} where b = constant, and y^O. 

9-3 The Principle of Least Time pre-dates the Calculus of Variations. The propagation of 
a ray of light in adjoining media that have different indices of refraction is found to 
be governed by this principle. A ray of light moves at constant speed v in a medium 
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(1) from a point A to a point Bo on the x-axis. At Bo, its speed changes to 
a new constant value V 2 on entering medium (2). The ray continues until it reaches a 
point C in (2). If the true path A Bo -» C is such that the total travel time of the 
light in going from A to C is a minimim, show that 

(vi/v 2 ) = xoflyc 2 + (d - xo) 2 ]/^ 2 + x4 ic /(d - xo), (Snell’s law) 
where the symbols are defined in the following diagram: 



The path A -»> B -» C is an arbitrarily varied path. 

94 Hamilton’s Principle states that when a system is moving under conservative forces 
the time integral of the Lagrange function is stationary. (It is possible to show that 
this Principle holds for non-conservative forces). Apply Hamilton’s Principle to the 
case of a projectile of mass m moving in a constant gravitational field, in the plane. 
Let the projectile be launched from the origin of Cartesian coordinates at time t = 0. 
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The Lagrangian is 

L = m((dx/dt) 2 + (dy/dt)2)/2 - mgy 
Calculate Ldt, and obtain Newton’s equations of motion 

d 2 y/dt 2 + g = 0 and d 2 x/dt 2 = 0. 

9-5 Reconsider the example discussed in section 9.2 from the point of view of the Hamiltonian of the 
system. Obtain H(r, (j>, p r , p (| ), and solve Hamilton’s equations of motion to obtain the results given in 
Eqs.9.21 and 9.25. 
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10 

CONSERVATION LAWS, AGAIN 

10.1 The conservation of mechanical energy 

If the Hamiltonian of a system does not depend explicitly on the time, we have 

H = H(u, v, pv, ...). (10.1) 

In this case, the total differential dH is (for two generalized coordinates, u and v) 

dH = (<3H/du)du + (<9H/<9v)dv + (<9H%)dp u + (5H%)dpv. (10.2) 

If the positions and the momenta of the particles in the system change with time under their mutual 
interactions, then H also changes with time, so that 

dH/dt = (dH/du)du/dt + (<9H/<9v)dv/dt + (<9H/dpu)dp7dt + (3H%)dp,/dt 

= (— p u u) + (~PvV) + (up u ) + (vpv) (1 0.3) 

= 0, using Hamilton’s equationsof-motion. (1 0.4) 

Integration then gives 

H = constant. (10.5) 

In any system moving under the influence of conservative forces, a potential V exists. In such systems, 
the total mechanical energy is H = T + V, and we see that it is a constant of the motion. 

102 The conservation of linear and angular momentum 


If the Hamiltonian, H, does not depend explicitly on a given generalized coordinate then the 
generalized momentum associated with that coordinate is conserved. For example, if H contains no 
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explicit reference to an angular coordinate then the angular momentum associated with that angle is 
conserved. Formally, we have 

dp/dt = -3H/3q , where p and q are the generalized momenta and 

coordinates. (10.6) 

Let an infinitesimal change in the jth-coordinate q be made, so that 

q^q + 6q, (10.7) 

then we have 

5H = (3H/dq)Sq. (10.8) 

If the Hamiltonian is invariant under the infinitesimal displacement Sq, then the generalized momentum 
p is a constant of the motion. The conservation of linear momentum is therefore a consequence of the 
homogeneity of space, and the conservation of angular momentum is a consequence of the isotropy 
ofspace. 

The observed conservation laws therefore imply that the choice of a point in space for the origin of 
coordinates, and the choice of an axis of orientation play no part in the formulation of the physical laws; 
the Laws of Nature do not depend on an “absolute space”. 



172 


11 

CHAOS 

The behavior of many non-linear dynamical systems as a function of time is found to be 
chaotic. The characteristic feature of chaos is that the system never repeats its past behavior. Chaotic 
systems nonetheless obey classical laws of motion which means that the equations of motion are 
deterministic. 

Poincare was the first to study the effects of small changes in the initial conditions on the 
evolution of chaotic systems that obey non-linear equations of motion. In a chaotic system, the erratic 
behavior is due to the internal, or intrinsic, dynamics of the system. 

Let a dynamical system be described by a set of first-order differential equations: 

dXl/dt = fl(Xl,X2,X3,...Xn) ( 11 . 1 ) 

dX 2 /dt= f 2 (xi ,X2,X3, . . -Xn) 


dXn/dt— fn(Xl ,X2,X3, . . .Xn) 

where the functions f n are functions of n-variables. 

Tine necessary conditions for chaotic motion of the system are 

1) the equations of motion must contain a non-linear term that couples several of the variables. 

A typical non-linear equation, in which two of the variables are coupled, is therefore 
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dxi/dt= axi + bx 2 + CX 1 X 2 + ... rxn, (a, d, c, ...r are constants) (1 1 .2) 

and 

2) the number of independent variables, n, must be at least three. 

The second condition is discussed later. 

The non-linearity often makes the solution of the equations unstable for particular choices of the 
parameters. Numerical methods of solution must be adopted in all but a few standard cases. 

1 1 .1 The general motion of a damped, driven pendulum 

The equation of a damped, driven pendulum is 

ml(d 2 0/dt 2 ) + kml(d0/dt) + mgsin0 = Acos(coDt) (1 1 .3) 

or 

(cP0/dt 2 ) + k(d0/dt) + (g/l)sin0 = (A/ml)cosM, (1 1 .4) 

where 0 is the angular displacement of the pendulum, I is its length, m is its mass, the resistance is 
proportional to the velocity (constant of proportionality, k), A is the amplitude and cod is the angular 
frequency of the driving force. 

Baker and Gollub in Chaotic Dynamics (Cambridge, 1 990) write this equation in the form 

(cPO/dt 2 ) + (1/q)(d0/dt) + sin0 = Ccos(co D t), (1 1 .5) 

where q is the damping factor. The low-amplitude natural angular frequency of the pendulum is unity, 
and time is dimensionless. We can therefore write 


the equation in terms of three first-order differential equations 

dco/dt = -(1/q)oo - sin0 + Ccos((|)) where (j> is the phase, 


( 11 . 6 ) 



174 


d0/dt=co, (11.7) 

and 

ck|)/dt = cod . (11.8) 

Tine three variables are (go, 0, (|>). 

The onset of chaotic motion of the pendulum depends on the choice of the parameters q, C, and god. 
The phase space of the oscillations is threedimensional: 



The 0 - go trajectories are projections of the spiral onto the 0 - go plane. 

The motion is sensitive to god since the non-linear terms generate many new resonances that occur 
when coc/conaiurai is a rational number. (Here, oo^ is the angular frequency of the undamped linear 
oscillator). For particular values of q and god, the forcing term produces a damped motion that is no 
longer periodic — the motion becomes chaotic. Periodic motion is characterized by cbsed orbits in 
the (0 - go) plane. If the damping is reduced considerably, the motion can become highly chaotic. 
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Tine system is sensitive to small changes in the initial conditions. Tine trajectories in phase 
space diverge from each other with exponential tim&dependence. For chaotic motion, the projection 
of the trajectory in (0, go, (j>) - space onto the (0 - go) plane generates trajectories that intersect. 
However, in the full 3 - space, a spiraling line along the (j>axis never intersects itself. We therefore see 
that chaotic motion can exist only when the system has at least a 3 - dimensional phase space. The 
path then converges towards the aflracforwithout self-crossing. 

Small changes in the initial conditions of a chaotic system may produoe very different 
trajectories in phase space. These trajectories diverge, and their divergence increases exponentially 
with time. If the difference between trajectories as a function of time is d(t) then it is found that logd(t) ~ 
At or 

d(t)~e* (11.9) 

where A > 0 - a positive quantity called the Lyapunov exponent. In a weakly chaotic system A « 0.1 
whereas, in a strongly chaotic system, A » 0.1 . 

112 The numerical solution of differential equations 
A numerical method of solving linear differential equations that is suitable in the present case is 
known as the Runge-Kutta method. The algorithm for solving two equations that are functions of 
several variables is: 

Let 

dy/dx = f(x, y, z) and dz/dx = g(x, y, z). (1 1 .10) 

If y = yo and z = zo when x = xo then, for increments h in xo , k in yo, and I in zo 
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the Runge-Kutta equations are 


ki = hf(xo, yo, zo) 

k 2 = hf(xb + h/2, yo + ki/2, zo + h/2) 
k3 = hf(xo + h/2, yo + k^2, zo + 12 / 2 ) 
t<4 = hf(xo + h, y 0 + k3, zo + b) 


h = hg(xo, yo, zo) 

b = hg(xo + h/2, yo + ki/2, zo + h/2) 
h = hg(xo + h/2, yo + fe/2, zo + h/2) 
U = hg(xo + h, y 0 + k3, zq + b) 


k - (ki + 2k2 + 2k3 + k4)/6 
and 

I = (h + 2b + 2b + U)/6. (11.11) 

The initial values are incremented, and successive values of tie x, y, and z are generated by iterations. 
It is often advantageous to use varying values of h to optimize the procedure. 

In the present case, 

f(x, y, z) -» f(t, 0, to) and g(x, y, z) g(co). 

As a problem, develop an algorithm to solve the non-linear equation 11.5 using the Runge- 
Kutta method for three equations (1 1 .6, 1 1 .7, and 1 1 .8). Write a program to calculate the necessary 
iterations. Choose increments in time that are small enough to reveal the details in the 0oo plane. 
Examples of non-chaotic and chaotic behavior are shown in the following two diagrams. 
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The parameters used to obtain this plot in the 0-co plane are : 

damping factor (1/q) = 1/5, 
amplitude (C) = 2, 
drive frequency (cod) = 0.7, and 
time increment, At =0.05. 

All the initial values are zero. 


The parameters used to obtain this plot in the 0-co plane are: 

damping factor (1/q) = 1/2, 
amplitude (C) = 1.15, 
drive frequency (cod) = 0.597, and 
time increment, At =1. 

Tie intial value of the time is 100. 
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12 

WAVE MOTION 

12.1 The basic form of a wave 

Wave motion in a medium is a collective phenomenon that involves local interactions among 
the particles of the medium. Waves are characterized by: 

1) a disturbance in space and time. 

2) a transfer of energy from one place to another, 
and 

3) a non-transfer of material of the medium. 

(In a water wave, for example, the molecules move perpendicularly to the velocity vector of the 

wave). 

Consider a kink in a rope that propagates with a velocity V along the +x-axis, as shown 
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Assume that the shape of the kink does not change in moving a small distance Ax in a short interval of 
time At. The speed of the kink is defined to be V = Ax/At. The displacement in the ydirection is a 
function of x and t, 

y = f(x, t). 

We wish to answer the question: what basic principles determine the form of the argument of the 
function, f ? For water waves, acoustical waves, waves along flexible strings, etc. the wave velocities 
are much less than c. Since y is a function of x and t, we see that all points on the waveform move in 
such a way that the Galilean transformation holds for all inertial observers of the waveform. Consider 
two inertial observers, observer #1 at rest on the x-axis, watching the wave move along the x-axis with 
constant speed, V, and a second observer #2, moving with the wave. If the observers synchronize 
their clocks so that ti = fe = to = 0 at xi = X 2 = 0, then 

X2 = Xl-Vt. 

We therefore see that the functional form of the wave is determined by the form of the Galilean 
transformation, so that 

y(x, t) =f(x— Vt), (12.1) 

where V is the wave velocity in the particular medium. No other functional form is possible! For 
example, 

y(x, t) = Asink(x- Vt) is permitted, whereas 
y(x,t)=A(x 2 +V 2 t)is not. 

If the wave moves to the left (in the-x direction) then 
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y(x,t)=f(x+Vt). (12.2) 

We shall consider waves that superimpose linearly. If, for example, two waves move along a 
rope in opposite directions, we observe that they “pass through each other"’. 

If the wave is harmonic, the displacement measured as a function of time at the origin, x = 0, is also 
harmonic: 

yo(0,t)=Acos(oat) 

where A is the maximum amplitude, and go = 2ttv is the angular frequency. 

Tine general form of y(x, t), consistent with the Galilean transformation, is 
y(x,t)=Acos{k(x-Vt)} 

where k is introduced to make the argument dimensionless (k has dimensions of 1/[length]). We then 
have 


yo(0, t) = Acos(kVt) = Acos(cot). 


Therefore, 


go = kV, the angular frequency, 


(12.3) 


or 


2ttv = kV, 


so that, 


k = 2 ttvA/ = 2tt/VT where T = 1/v, is the period. 


(12.4) 


Tie general form is then 


y(x, t) = A cos{(2ttA/T)(x- Vt)} 
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= Acos{(2ti/X,)(x - Vt)}, where X = VT is the wavelength, 

= Acos{(2ttx/a - 2itt/T)}, 

= Acos(kx - 2 tcI/T), where k = 2nlX, the “wavenumber'’, 

=Acos(kx-o)t), 

= A cos(oat - kx), because cos(-0) = cos(6). (1 2.5) 

For a wave moving in three dimensions, the diplacement at a point x, y, z at time t has the form 

i|)(x, y, z, t) = Acos(cot — k-r), (1 2.6) 

where |k| = 2nlX and r = [x, y, z]. 

12.2 The general wave equation 

An arbitrary waveform in one space dimension can be written as the superposition of two 
waves, one travelling to the right (+x) and the other to the left (-x) of the origin. The displacement is 
then 

y(x, t) = f(x— Vt) + g(x + Vt). (12.7) 

Put 

u = f(x- Vt) = f(p), and v = g(x + Vt) = g(q), 
then 

y=u+v. 

Now, 

3y/3x = du/3x + 3v/3x = (du/dp)(3p/3x) + (dv/dq)(3q/3x) 

=f(p)(ap/ax)+g'(q)(aq/ax). 
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Also, 

52y IM = (3/<9x){(du/dp)((9p/<9x) + (dv/dq)(<9q /ax)} 

= f(p)(<92p/<9x 2 ) + f"(p)(«9p/<9x)2 + g'(q)(«92q/(9x 2 ) + g"(q)(«9q/<9x)2. 

We can obtain the second derivative of y with respect to time using a similar method: 

32y/dt 2 = f (p)(^p/a 2 ) + f"(p)(dp/a) 2 + g'(q)(aq/a 2 ) + g"(q)(<9q/a)2. 

Now, 3p/5x = 1, 3q/5x = 1, 3p/<9t = -V, and 3q/dt = V, and all second derivatives are zero (V is a 
constant). We therefore obtain 

5 2 y/5x 2 =f"(p) + g"(q), 
and 

5 2 y/a 2 =f"(p)V 2 + g"(q)V 2 . 

Therefore, 

ay/a 2 =v2(ay/a< 2 ). 

or 

ay/a< 2 - (W 2 )(ay/a 2 )=o. ( 12 . 8 ) 

This is the wave equation in onedimensional space. For a wave propagating in threedimensional 
space, we have 

Sftp-{W){dh\>ldt 2 )=0, (12.9) 

the general form of the wave equation, in which ip(x, y, z, t) is the general amplitude function. 

12.3 The Lorentz invariant phase of a wave and the relativistic Doppler shift 

A wave propagating through space and time has a “wave function” 
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ip(x,y,z,t)=Acos(oat-k-r) 
where the symbols have the meanings given in 12.2. 

The argument of this function can be written as follows 

ip = Acos{(a)/c)(ct) - kr). (12.10) 

It was not until deBroglie developed his revolutionary idea of particle-wave duality in 1923-24 that the 
Lorentz invariance of the argument of this function was fully appreciated! We have 
x|j = Acos{[o >/c, k] T [ct, — r]} 

= Acos{K'E | ,} = Acos(|), where (j> is the “phase”. (12.11) 

deBroglie recognized that the phase <j> is a Lorentz invariant formed from the 4-vectors 

K M = [oo/c, k], the “frequency-wavelength” 4-vector, (12.12) 

and 

E,, = [ct, — r], the covariant “event” 4-vector. 

deBroglie’s discovery turned out to be of great importance in the development of Quantum Physics. It 
also provides us with the basic equations for an exact derivation of the relativistic Doppler shift. The 
frequency-wavelength vector is a Lorentz 4-vector, which means that it transforms between inertial 
observers in the standard way: 

K' 1 ’ = LK M , (12.13) 

or 
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The transformation of the first element therefore gives 

co'/c = y(oa/c) - (3yk x , (12.14) 

so that 

2ttv' = y2ttv - fiy ($ Tifk ) 


or 


v' = yv - Vy(v/c), (where go = 2ttv, V/c = |3, and c = vh) 

therefore 

v'=yv(1 -|3) 

or v'=(v/(1-|32) 15 )(1-|3) 

giving 

v'=v{(1-(3)/(1+|3F. (12.15) 

This is the relativistic Doppler shift for the special case of photons - we have Lorentz invariance in 
action. This result was derived in section 6.2 using the Lorentz invariance of the energy-momentum 4- 
vector, and the Planck-Einstein result E = hv for the relation between the energy E and the frequency v 
of a photon. The present derivation of the relativistic Doppler shift is independent of the Planck-Einstein 
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result, and therefore provides an independent verification of their fundamental equation E = hv for a 
photon. 

12.4 Plane harmonic waves 

The on&dimensional wave equation (12.8) has the solution 
y(t,x)=Acos(kx-cot), 

where go = kV and A is independent of both x and t. 

This form is readily shown to be a solution of (12.8) by direct calculation of its 2nd partial derivatives, 
and their substitution in the wave equation. 

The three-dimensional wave equation (12.9) has the solution 
ip(t, x, y, z) = i|)ocos{(kxX + k y y + kzz) - cot}, 
where go = |k|V, and k = [k x , k y , kz], the wave vector. 

The solution ip(t, x, y, z) is called a plane harmonic wave because constant values of the argument (k<x 
+ k y y + kzz) - Got define a set of planes in space— surfaces of constant phase: 
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It is often useful to represent a plane harmonic wave as the real part of the remarkable Cotes-Euler 
equation 

e p = cos0 + isin0, i = V— 1 , 

so that 

i|jocos((k-r) - cot) = R.P. ipoe^- 03 *). 

fine complex form is readily shown to be a solution of the threedimensional wave equation. 

12.5 Spherical waves 

For given values of the radial coordinate, r, and the time, t, the functions 

cos(kr- cot) and eM have constant values on a sphere of radius r. In order for the wave functions to 

represent expanding spherical waves , we must modify their forms as 

follows: 

(1/r)cos(kr-cot)and(1/r)e^ (kalongr). (12.16) 

These changes are needed to ensure that the wave functions are solutions of the wave equation. To 
demonstrate that the spherical wave (1/r)cos(kr- cot) is a solution of (12.9), we must transform the 
Laplacian operator from Cartesian to polar coordinates, 

V^zHV^e,# 

The transformation is 

WM + 32/<9y 2 + m z 2 - (1/r 2 p/<9r)(r 2 (a/dr)) + (1 /sinQ)(a/ao)(sinQ(a/ao)) 

+ (1/sin 2 0)(5 2 /a|) 2 )]. (12.17) 


This transformation is set as a problem. 
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If there is spherical symmetry, there is no angular-dependence, in which case, 

V2(r) = (Vr 2 )[dldc)[r 2 {dldi)) 

=m 2 + {2Jr){dldr). (12.18) 

We can check that 

ip=ipo(1/r)cos(kr-oat) 
is a solution of the radial form of (1 2.9), 

Differentiating twice, we find 

cftp/dr 2 = xpo[(-k 2 /r)cosu + (2k/r 2 )sinu + (2/r 3 )cosu], where u = kr- oat 
and 

c^ip/dt 2 = — xpo(o > 2 /r)cosu, oa = kV, 
from which we obtain 

(i/v 2 )a 2 i|va 2 -[a 2 ip/ar 2 + (2/r)&pa] = o. (12.19) 

12.6 The superposition of harmonic waves 

Consider two harmonic waves with the same amplitudes, ipo, travelling in the same direction, 
the x-axis. Let their angular frequencies be slightly different — oa ± 6oa with corresponding 
wavenumbers k ± 6k. Their resultant, v f ; , is given by 

\j/ =1 p 0 e^+^)x-(" +&o » 

+ip 0 e^-^) x -(“- &o » 

= ipoe^-^e^- 8 ^) + e^- 6 ^)] 


= i|j 0 e^-"’ti[2cos(6kx - 6oat)] 
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= AcoscJ), (12.20) 

where 

A = 2ipoe( kx -“ t ), the resultant amplitude, 
and 

<j) = Skx- Soot, the phase of the modulation envelope . 

The individual waves travel at a speed 

oo/k = v (|) , the phase velocity, (1 2.21 ) 

and the modulation envelope travels at a speed 

Soo/Sk = vg, the group velocity. (1 2.22) 

In the limit of a very large number of waves, each differing slightly in frequency from that of a neighbor, 
dk^O, in which case 

dco/dk=VG. 

For electromagnetic waves travelling through a vacuum, vg = v (|j = c, the speed of light. 

We shall not, at this stage, deal with the problem of the superposition of an arbitrary number of 
harmonic waves. 

12.7 Standing waves 

fine superposition of two waves of the same amplitudes and frequencies but travelling in 
opposite directions has the form 

'P = ipi + ip 2 = Acos(kx - cot) + Acos(kx + oat) 


= 2Acos(kx)cos(oot). 


(12.23) 
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This form describes a standing wave that pulsates with angular frequency go, associated with the time- 
dependent term COSGOt. 

In a traveling wave, the amplitudes of the waves of all particles in the medium are the same and their 
phases depend on position. In a standing wave, the amplitudes depend on position and the phases 
are the same. 

For standing waves, the amplitudes are a maximum when kx = 0, tc, 2tt, 3tt, ... 
and they are a minimum when kx = tt/2, 3ti/2, 5ti/2, ...(the nodes). 

PROBLEMS 

The main treatment of wave motion, including interference and diffraction effects, takes place 
in the second semester (Part 2) in discussing Electromagnetism and Optics. 

12-1 Ripples on the surface of water with wavelengths of about one centimeter are found 
to have a phase velocity v f|) = V(ak) where k is the wave number and a is a 
constant characteristic of water. Show that their group velocity is \fc = (3/2)v ([) . 

12-2 Show that 

y(x, t) = exp{x-vt} 

represents a travelling wave but not a periodic wa/e. 

1 2-3 Two plane waves have the same frequency and they oscillate in the z-direction; they 
have the forms 

ip(x, t) = 4sin{20t + (ttx/3) + tt}, and 


ip(y, t) = 2sin{20t + (ny/4) + tt}. 
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Show that their superposition at x = 5 and y = 2 is given by 
ip(t) = 2.48sin{20t- (tt/5)}. 

1 24 Express the standing wave y = Asin(ax)sin(bt), where a and b are constants as a 
combination of travelling waves. 

1 2-5 Perhaps the most important application of the relativistic Doppler shift has been, and 
continues to be, the measurement of the velocities of recession of distant galaxies 
relative to the Earth. The electromagnetic radiation associated with ionized calcium 
atoms that escape from a galaxy in Hydra has a measured wavlength of 4750 x 1 0- 10 m, 
and this is to be compared with a wavelength of 3940 x 1 0- 10 m for the same process 
measured for a stationary source on Earth. Show that the measured wavelengths 
indicate that the galaxy in Hydra is receding from the Earth with a speed v = 0. 1 87c. 
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13 

ORTHOGONAL FUNCTIONS AND FOURIER SERIES 

13.1 Definitions 

Two n-vectors 

An = [ai, a 2 , ...a n ] and B n = [bi, te, ...b n ] 
are said to be orthogonal if 

Xp=i,n]aibi = 0. (13.1) 

(Their scalar product is zero). 

Two functions A(x) and B(x) are orthogonal in the range x = a to x = b if 
J M A(x)B(x)dx=0. (132) 

fine limits must be given in order to specify the range in which the functions A(x) and B(x) are defined, 
fine set of real, continuous functions {(f>i(x), ^(x), ...} is orthogonal in [a, b] if 


l[ab] <j)m(x)<f) n (x)dx = 0 for m ^ n. (13.3) 

If, in addition, 

J[a,b] cpn^xjdx = 1 for all n, (13.4) 

the set is normal, and therefore it is said to be orthonormal. 
fine infinite set 

{cosQx, coslx, cos2x, ... sinOx, sinlx, sin2x, ...} (13.5) 

in the range [— tt, tt] of x is an example of an orthogonal set. For example, 

Jf-Tx. tt] cosx-cos2xdx = 0 etc., (1 3.6) 
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and 

^ 7I , 1T ]Cos 2 xdx^0=Ti,etc. 

This set which is orthogonal in any interval of x of length 2n, is of interest in Mathematics 
because a large class of functions of x can be expressed as linear combinations of the members of the 
set in the interval 2n. For example we can often write 

(|)(x) = CKpi + c$z + where the c’s are constants 
= aocosOx + aicoslx + a2Cos2x + ... 

+ bosinOx + bisinlx + b2Sin2x + ... (13.7) 

A large class of periodic functions , of period 2n, can be expressed in this way. When a function can be 
expressed as a linear combination of the orthogonal set 
{1, coslx, cos2x, ... 0 , sinlx, sin2x, ...} , 
it is said to be expanded in its Fourier series. 

13 2 Some trigonometric identities and their Fourier series 

Some of the familiar trigonometric identities involve Fourier series. For example, 

cos2x = 1-2sin 2 x (13.8) 

can be written 

sin 2 x=(1/2)-(1/2)cos2x 
and this can be written 

sin^ = {(1/2)cos0x + Ocoslx- (1/2)cos2x + 0cos3x + ... 


+ 0{sin0x + sinlx + sin2x + ...} 


(13.9) 
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-> the Fourier series ofsiifx. 

The Fourier series of cos 2 x is 

cos2x = (1/2) + (1/2)cos2x. (13.10) 

More complicated trigonometric identies also can be expanded in their Fourier series. For example, the 
identity 

sin3x=3sinx-4sin 3 x 

can be written 

sin 3 x = (3/4)sinx- (1/4)sin3x, (13.11) 

and this is the Fourier series of sink 
Tine terms in the series represent the “harmonics“of the function sink 
In a similar fashion, we find that the identity 
cos3x=4cos 3 x-3cosx 

can be rearranged to give the Fourier series of cosfr 

cos 3 x = (3/4) + (1/4)cos3x. (13.12) 

In general, a combination of deMoivre’s theorem and the binomial theorem can be used to 
write cos(nx) and sin(nx) (for n a positive integer) in terms of powers of sinx and cosx. We have 

cos(nx) + isin(nx) = (cosx + isinx) 1 (i = V-1 ) (deMoivre) (13.13) 

and 

(a + by = a n + na^b + (n(n-1)/2!) a^b 2 ...+b n . (13.14) 


For example, if n = 4, we obtain 



195 


cos^x = (1/8)cos4x + (1/2)cos2x + (3/8), (13.15) 

and 

sin 4 x = (1/8)cos4x-(1/2)cos2x + (3/8). (13.16) 

13.3 Determination of the Fourier coefficients of afunction 

If, in the interval [a, b], the function f(x) can be expanded in terms of the set 
{<Mx), 4 > 2 (x), ...}, which means that 

f(x) = X[fi,°°] ckKx), (13.17) 

where {cj>i(x), cj^x), ...} is orthogonal in [a, b], then the coefficients can be evaluated as follows: 

to determine the kth-coefficient, ck, multiply f(x) by (|>k(x), and integrate over the interval [a, b]: 

J[a,b] f(x)cpk(x)dx = J[a.b] ClCplCpkClX + ...|[a,b]Q«f^dX+ ... (13.18) 

= 0 + 0 + 7^0 +0... 

The integrals of the products c\hi\h in the range [-jt, jt] are all zero except for the case that involves 
We therefore obtain the kth-coefficient 

Ck = J[a, b] f(x)(f)k(x)dx / J[a,b] Cpk 2 (x)dx k = 1 , 2, 3,.. (13.19) 

13.4 The Fourier series of a periodic saw-tooth waveform 

In standard works on Fourier analysis it is proved that every periodic continuous function f(x) of 
period 2tt can be expanded in terms of {1 , cosx, cos2x, ...0, sinx, sin2x, ...}; this orthogonal set is said to 
be complete with respect to the set of periodic continuous functions f(x) in [a, b]. 
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Tine function has the following forms in the three intervals 
f(x) = (-2/tt)(x + tt) for -tc < x < -tt/2, 

= 2x/tt for-n/2 < x < tt/2, 

and 

= (-2/tt)(x - tt) for tt/2 < x < tc. 
fine periodicity means that f(x + 2n) = f(x). 

fine function f(x) can be represented as a linear combination of the series {1, cosx, cos2x, 
...sinx, sin2x, ...}: 

f(x) = aocosOx + aicoslx + a2Cos2x + ...akcoskx + ... 

+ bosinOx + bisinlx + b2sin2x + ...bkSinkx + ... (13.20) 

Tine coefficients are given by 

ak = Jpn,Tx] coskx f(x)dx / |^, Tt ]Cos 2 kxdx = 0, (f(x) is odd, coskx is even, and 


[— tc, tt] is symmetric about 0), (1 3.21 ) 
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and 

bk = W] sinkxf(x)dx/l^Tu] sin 2 kxdx + 0, 

= (-2/tt)(x + Tc)sinkxdx + J^ncj (2x/Ti)sinkxdx 

+ .Utt] (— 2/tc)(x - 7i)sinkxdx } 

={8/(n:k) 2 }sin(kn/2). (13.22) 

Tine Fourier series of f(x) is therefore 

f(x) = (8/Tr 2 Xsinx-(1/32)sin3x+(1/^)sin5x -(1/7 2 )sin7x+ ...}. 

Tine above procedure can be generalized to include functions that are not periodic. The sum 
of discrete Fourier components then becomes an integral of the amplitude of the component of angular 
frequency go = 2ttv with respect to go. This is a subject covered in the more advanced treatments of 
Physics. 


PROBLEMS 

1 3-1 Use deMoivre’s theorem and the binomial theorem to obtain the Fourier expansions: 

1) cosfy = 3/8 + (1/2)cos2x + (1/8)cos4x, 
and 

2) siifx = 3/8 - (1/2)cos2x + (1/8)cos4x. 

Plot these components (harmonics) and their sums for-u < 0 < tt. 


1 3-2 Use the method of integration of orthogonal functions to obtain the Fourier series of 
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problem 13-1 ; you should obtain the same results as above! 

1 3-3 Show that 1 ) if f(x) = - f(— x), only sine functions occur in the Fourier series for f[x), 
and 


2) if f(x) = f(-x), only cosine functions occur in the Fourier series for f(x). 

134 The Fourier series of a function f(t) that is a periodic repetition outside (-T, T), of the shape inside, 
with period 2n is often written in the form 

f(t) = (ao/2) + X[n=i ,oo]{ancos(nrtl/r) + bnSin(nrct/T)}, 
where 

a n = (1 /T)J h x t] f(t)cos(nTrt/T)dt, 

and 

b n = (1 /T)J^tti f(t)sin(nTtt/T)dt. 

If f(t) is a periodic square-wave: 
f(t) = 3 for 0 < t < 5pis 

= 0 for 5 < t < 10us, with period 2T= lOus 


< -10 -5 

obtain the Fourier series: 


f(t) A 

3 - 

0 

0 


5 


10 


t 


f(t) = (3/2) +(3/TT)X[n=i«] [(1 - cosnTc)/n]sin(mTt/5)). 
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Compute this series for n = 1 to 5 and -5 < t < 5, and compare the truncated 
series with the exact waveform. 

1 3-5 It is interesting to note that the series in 1 34 converges to the exact value f(t) = 3 
at the value t = 5/2 us, so that 

3 = (3/2) + (3/n)X[n=i, ~f(1 -cosnTi)/n]sin(nTt/2). 

Use this result to obtain the important Gregory-Leibniz infinite series : 

(ti/4) = 1 - (1/3) + (1/5) - (1/7) + ... 
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Appendix A 

Solving ordinary differential equations 

Typical dynamical equations of Physics are 

1 ) Force in the x-direction = mass x acceleration in the x-direction with the mathematical 
form 

F x =ma x =md 2 x/dt 2 , 

and 

2) The amplitude y(x, t) of a wave at (x, t), travelling at constant speed V along 
the x-axis with the mathematical form 

(1/V 2 )5 2 ya 2 -5 2 y/ax 2 =0. 

Such equations, that involve differential coefficients, are called differential equations. 

An equation of the form 

f(x,y(x),dy(x)/dx;ar) = 0 (A.1) 

that contains 

i) a variable y that depends on a single, independent variable x, 

ii) a first derivative dy(x)/dx, 


iii) constants, a r , 
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is called an ordinary (a single independent variable) differential equation of the first order (a first 
derivative, only). 

An equation of the form 

f(xi, X2, ...Xn, y(xi, X2, ...Xn), 3y/3xi, (9y/5x 2 , ...dy/3xn; ^ySxi 2 , &yld&, 

...cPyldx n 2 ; <9y/5xi n , sySx^ 1 , ...<9y/3xn n ; ai, a 2 , ...a) = 0 (A.2) 

that contains 

i) a variable y that depends on n-independent variables xi,X 2 , ...Xn, 

ii) the 1st- 2nd-, ...nth-order partial derivatives: 

3y/3xi, ...52y/axi 2 , ...<9y/<9xi n , ..., 
and 

iii) r constants, ai,a 2 ,...a r , 

is called a partial differential equation of the nthorder. 

Some of the techniques for solving ordinary linear differential equations are given in this appendix. 

An ordinary differential equation is formed from a particular functional relation, f(x, y; ai, a 2 , 
...a n ) that involves n arbitrary constants. Successive differentiations of f with respect to x, yield n 
relationships involving x, y, and the first n derivatives of y with respect to x, and some (or possibly all) of 
the n constants. There are (n + 1) relationships from which the n constants can be eliminated. The 
result will involve dy/dx 11 , differential coefficients of lower orders, together with x, and y, and no arbitrary 
constants. 

Consider, for example, the standard equation of a parabola: 
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y 2 - 4ax. = 0, where a is a constant. 

Differentiating, gives 

2y(dy/dx)-4a = 0 

so that 

y - 2x(dy/dx) = 0, a differential equation that does not contain the constant a. 

As another example, consider the equation 

f(x, y, a, b,c) = 0=x 2 +y 2 + ax+by+c=0. 

Differentiating three times successively, with respect to x, gives 

1 ) 2x + 2y(dy/dx) + a + b(dy/dx) = 0, 

2) 2 + 2{y(d 2 y/dx 2 ) + (dy/dx) 2 } + b(d 2 y/dx 2 ) = 0, 
and 

3) 2{y(d 3 y/dx 3 ) + (d2y/dx 2 )(dy/dx)} + 4(dy/dx)(d2y/dx 2 ) + b^/dx 3 ) = o. 

Eliminating b from 2) and 3), 

(d3y/dx 3 ){1 + (dy/dx) 2 } = (dy/dx)(d2y/dx 2 ) 2 . 

Tine most general solution of an ordinary differential equation of the nthorder contains n 
arbitrary constants. The solution that contains all the arbitrary constants is called the complete 
primative. If a solution is obtained from the complete primative by giving definite values to the constants 
then the (non-unique) solution is called a particular integral. 

Equations of the Istorderand degree. 

The equation 
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M(x, y)(dy/dx) + N(x, y) = 0 (A.3) 

is separable if M/N can be reduced to the form fi(y)/f 2 (x), where fi does not involve x, and f 2 does not 
involve y. Specific cases that are met are: 

i) y absent in M and N, so that M and N are functions of x only; Eq. (A.3) then can be written 
(dy/dx) = —(M/N) = F(x) 

therefore 


y = {F(x)dx + C, where C is a constant of integration. 


ii) x absent in M and N. 

Eq. (A.3) then becomes 

(M/N)(dy/dx) = — 1 , 


so that 


F(y)(dy/dx) =-1 , (M/N = F(y)) 

therefore 

x=-|F(y)dy+C. 

iii) x and y present in M and N, but the variables are separable. 
Put M/N = f(y)/g(x), then Eq. (A.3) becomes 
f(y)(dy/dx)+g(x)=0. 

Integrating overx, 

jf(y)(dy/dx)dx + jg(x)dx = 0. 


or 
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J«dy+Jg(x)dx=0. 

For example, consider the differential equation 
x(dy/dx) + coty=0. 

This can be written 

(siny/cosy)(dy/dx) + 1/x = 0. 

Integrating, and putting the constant of integration C = InD, 

J(siny/cosy)dy + J(1/x)dx = InD, 
so that 

— ln(cosy) + Inx = InD, 
or 

ln(x/cosy) = InD. 

The solution is therefore 

y=cos- 1 (x/D). 

Exact equations 
Tine equation 

ydx + xdy = 0 is said to be exact because it can be written as 
d(xy) =0, or 
xy= constant. 

Consider the non-exact equation 
(tany)dx+(tanx)dy=0. 
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We see that it can be made exact by multiplying throughout by cosxcosy, giving 
sinycosxdx + sinxcosydy = 0 (exact) 
so that 

d(sinysinx) = 0, 
or 


sinysinx = constant. 

The term cosxcosy is called an integrating factor. 

Homogeneous differential equations. 

A homogeneous equation of the nth degree in x and y is such that the powers of x and y in 
every term of the equation is n. For example, x 2 y + 2xy 2 + 3y 3 is a homogeneous equation of the third 
degree. If, in the differential equation M(dy/dx) + N = 0 the terms M and N are homogeneous functions 
of x and y, of the same degree, then we have a homogeneous differential equation of the 1 st order and 
degree. The differential equation then reduces to 
dy/dx = — (N/M) = F(y/x) 

To find whether or not a function F(x, y) can be written F(y/x), put 

y=vx. 

If the result is F(v) (all x’s cancel) then F is homogeneous. For example 

dy/dx = (x 2 + y 2 )/2x 2 -»> dy/dx = (1 + v 2 )/2 = F(v), therefore the equation is 


homogeneous. 
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Since dy/dx -»> F(v) by putting y = vx on the right-hand side of the equation, we make the same 
substitution on the left-hand side to obtain 
v+x(dv/dx)=(1 +v 2 )/2 

therefore 

2xdv=(1 +v 2 -2v)dx. 

Separating the variables 

2dv/(v- 1 f = dx/x, and this can be integrated. 

Linear Equations 
Tine equation 

dy/dx +M(x)y=N(x) 

is said to be linear and of the 1st order. An example of such an equation is 
dy/dx +(1/x)y=x 2 . 

This equation can be solved by introducing the integrating factor, x, so that 
x(dy/dx)+y=x 3 , 

therefore 


giving 


(d/dx)(xy)=x 3 , 


xy = x 4 /4 + constant. 


In general, let R be an integrating factor, then 
R(dy/dx) + RMy=RN, 
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in which case, the left-hand side is the differential coefficient of some product with a first term R(dy/dx). 
Tine product must be Ry! Put, therefore 

R(dy/dx) + RMy= (d/dx)(Ry) = R(dy/dx) + y(dR/dx). 

Now, 

RMy=y(dR/dx), 

which leads to 

jM(x)dx = JdR/R = InR, 


or 

R=exp{fM(x)dx}. 

We therefore have the following procedure: to solve the differential equation 
(dy/dx) + M(x)y=N(x), 

multiply each side by the integrating factor exp{fM(x)dx}, and integrate. For example, let 
(dy/dx) + (1/x)y = x 2 , 

so that 

{M(x)dx = J(1/x)dx = Inx and the integrating factor is exp{lnx} = x:. We therefore obtain the 

equation 


x(dy/dx) + (1/x)y=x 3 , 
deduced previously on intuitive grounds. 
Linear Equations with Constant Coefficients. 


Consider the 1st order linear differential equation 
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po(dy/dx) + piy = 0, where po, pi are constants. 

Writing this as 

po(dy/y) + pi dx = 0, 

we can integrate term-by-term, so that 
polny + pix = constant, 

therefore 

lny=(-pi/po)x+ constant 
= (-pi/p 0 )x + InA, say 

therefore 

y=Aexp{(-pi/po)x}. 

Linear differential equations with constant coefficients of the 2nd order occur often in Physics. They are 
typified by the forms 
po(d 2 y/dx 2 ) + pi(dy/dx) + p 2 y = 0. 

Tine solution of an equation of this form is obtained by following the insight gained in solving the 1st 
order equation!. We try a solution of the type 

y=Aexp{mx}, 

so that the equation is 
Aexp{mx}(pom 2 + pm + P 2 ) = 0. 

If mis a root of 


pom 2 + pim + P2 = 0 
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then y = Aexp{mx} is a solution of the original equation for all values of A. 

Let the roots be a and (3. If a + |3 there are two solutions 
y = Aexp{ax }and y = Bexp{|3x.}. 

If we put 

y=Aexp{ax} + Bexp{|3x} 

in the original equation then 

Aexp{ax}(poa 2 + pia + p 2 ) + Bexp{|3x}(po[3 2 + pi|3 + p 2 ) = 0, 
which is true as a and (3 are the roots of 

pom 2 + pirn + p 2 = 0, (called the auxilliary equator) ) 

The original equation is linear, therefore the sum of the two solutions is, itself, a (third) solution. The third 
solution contains two arbitrary constants (the order of the equation), and it is therefore the general 
solutbn. 

As an example of the method, consider solving the equation 
2(d 2 y/dx 2 ) + 5(dy/dx) + 2y = 0. 

Put y = Aexp{mx }as a trial solution, then 
Aexp{mx}(2m 2 + 5m + 2) = 0, so that 

m = -2 or -1/2, therefore the general solution is 
y = Aexp{-2x} + Bexp{(-1/2)x}. 

If the roots of the auxiliary equation are complex, then 
y = Aexp{p + iq}x + Bexpjp - iq}x, 
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where the roots are p ± iq ( p, q e R). 

In practice, we write 

y = exp{px}[Ecosqx + Fsinqx] 
where E and F are arbitrary constants. 

For example, consider the solution of the equation 
d2y/dx 2 -6(dy/dx) + 13y=0, 
therefore 


m 2 -6m + 13 = 0, 


so that 


m = 3±i2. 


We therefore have 


y = Aexp{(3 + i2)x} + Bexp{3- i2)x} 

= exp{3x}(Ecos2x + Fsin2x). 

Tbe general solution of a linear differential equation with constant coefficients is the sum of a particular 
integral and the complementary function (obtained by putting zero for the function of x that appears in 
the original equation). 



211 


BIBLIOGRAPHY 

Those books that have had an important influence on the subject matter and the style of this 
book are recognized with the symbol *. I am indebted to the many authors for providing a source of 
fundamental knowledge that I have attempted to absorb in a process of continuing education over a 
period of fifty years. 

General Physics 

*Feynman, R. P., Leighton, R. B., and Sands, M., The Feynman Lectures on Physics, 3 vols., 
Addison-Wesley Publishing Company, Reading, MA (1964). 

*Joos, G., Theoretical Physics, Dover Publications, Inc., New York, 3rd edn (1986). 

Lindsay, R. B., Concepts and Methods of Theoretical Physics, Van Nostrand Company, Inc., New 
York (1952). 

Mathematics 

Armstrong, M. A., Groups and Symmetry, Springer-Verlag, New York (1 988). 

*Caunt, G. W., An Introduction to Infinitesimal Calculus, The Clarendon Press, Oxford (1949). 

*Courant R., and John F., Introduction to Calculus and Analysis, 2 vols., John Wiley & Sons, New York 
(1974). 

Kline, M., Mathematical Thought from Ancient to Modem Times, Oxford University Press, Oxford 
(1972). 



212 


*Margenau, H., and Murphy, G. M., The Mathematics of Physics and Chemistry, Van Nostrand 
Company, Inc., New York, 2nd edn (1956). 

Minsky, L, An Introduction to Linear Algebra, Dover Publications, Inc, New York (1982). 

*Piaggio, H. T. H, An Elementary Treatise on Differential Equations, G. Bell & Sons, Ltd, London 
(1952). 

Samelson, H, An Introduction to Linear Algebra, John Wiley & Sons, New York (1 974). 

Stephenson, G, An Introduction to Matrices, Sets and Groups for Science Students Dover 
Publications, Inc, New york (1986). 

Yourgrau, W, and Mandelstam, S, Variational Principles in Dynamics and Quantum Theory, Dover 
Publications, Inc, New York 1979). 

Dynamics 

Becker, R. A, Introduction to Theoretical Mechanics, McGraw-Hill Book Company, Inc, New York 
(1954). 

Byerty, W. E, An Introduction to the Use of Generalized Coordinates in Mechanics and Physics, Dover 
Publications, Inc, New York (1965). 

Kilmister, C. W, Lagrangian Dynamics: an Introduction for Students Plenum Press, New York (1967). 
*Ramsey, A. S, Dynamics Part I, Cambridge University Press, Cambridge (1 951 ). 

*Routh, E. J, Dynamics of a System of Rigid Bodies, Dover Publications, Inc, New York (1960). 



213 


Whittaker, E. T., A Treatise on the Analytical Dynamics of Particles and Rigid Bodies, Cambridge 
University Press, Cambridge (1961). This is a classic work that goes well beyond the level of the 
present book. It is, nonetheless, well worth consulting to see what lies ahead! 

Relativity and Gravitation 

*Einstein, A. ., The Principle of Relativity, Dover Publications, Inc., New York (1 952). A collection of 
original papers on the Special and General Tlieories of Relativity. 

Dixon, W. G., Special Relativity, Cambridge University Press, Cambridge (1978). 

French, A. P., Special Relativity, W. W. Norton & Company, Inc., New York (1 968). 

Kenyon, I. R., General Relativity, Oxford University Press, Oxford (1990). 

Lucas, J. R., and Hodgson, P. E., Spacetime and Electromagnetism, Oxford University Press, Oxford 
(1990). 

*Ohanian, H. C., Gravitation and Spacetime, W. W. Norton & Company, Inc., New York (1 976). 
*Rindler, W., Introduction to Special Relativity, Oxford University Press, Oxford, 2nd edn (1991). 

Rosser, W. G. V., Introductory Relativity, Butterworth &Co. Ltd., London (1967). 

Non-Linear Dynamics 

*Baker, G. L, and Gollub, J. P., Chaotic Dynamics, Cambridge University Press, Cambridge (1991). 
Press, W. H., Teukolsky, S. A., Vetteriing W. T., and Flannery, B. P., Numerical Recipes in 
C, Cambridge University Press, Cambridge 2nd edn (1992). 


Waves 



214 


Crawford, F. S., Waves, (Berkeley Physics Series, vol 3), McGraw-Hill Book Company, Inc., New York 
(1968). 

French, A. P., Vibrations and Waves, W. W. Norton & Company, Inc., New York (1971). 

General reading 

Bronowski, J., The Ascent of Man, Little, Brown and Company, Boston (1973). 

Calder, N., Einstein’s Universe, fine Viking Press, New York (1979). 

Davies, P. C. W., Space and Time in the Modem Universe, Cambridge University Press, Cambridge 
(1977). 

Schrier, E. W., and Allman, W. F., eds., Newton at the Bat, Charles Scribner’s Sons, New York (1984). 



